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Preface to the Second Edition 


In 2013 the world mathematical community is celebrating the 100th anniver- 
sary of Paul Erdés’ birth. His personality is remembered by many of his 
friends, former disciples, and over 500 coauthors, and his mathematics is as 
alive and well as if he was still among us. In 1995/1996 we were preparing 
the two volumes of The Mathematics of Paul Erdos not only as a tribute to 
the achievements of one of the great mathematicians of the twentieth century 
but also to display the full scope of his ceuvre, the scientific activity which 
transcends individual disciplines and covers a large part of mathematics as 
we know it today. We did not want to produce just a “festschrift”. 

In 1995/1996 this was a reasonable thing to do since most people were 
aware of the (non-decreasing) Erddés activity only in their own particular area 
of research. For example, we combinatorialists somehow have a tendency to 
forget that the main activity of Erdés was number theory. 

In the busy preparation of the volumes we did not realize that at the end, 
when published, our volumes could be regarded as a tribute, as one of many 
obituaries and personal recollections which flooded the scientific (and even 
mass) media. It had to be so; the old master left. 

Why then do we think that the second edition should be published? 
Well, we believe that the quality of individual contributions in these volumes 
is unique, interesting and already partly historical (and irreplaceable— 
particularly in Part I of the first volume). Thus it should be updated and 
made available especially in this anniversary year. This we feel as our duty 
not only to our colleagues and authors but also to students and younger 
scientists who did not have a chance to meet the wandering scholar personally. 
We decided to prepare a second edition, asked our authors for updates and 
in a few instances we solicited new contributions in exciting new areas. The 
result is then a thoroughly edited volume which differs from the first edition 
in many places. 

On this occasion we would like to thank all our authors for their time and 
work in preparing their articles and, in many cases, modifying and updating 
them. We are fortunate that we could add three new contributions: one by 
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Joel Spencer (in the way of personal introduction), one by Larry Guth in 
Part IV of the first volume devoted to geometry, and one by Alexander 
Razborov in Part I of the second volume devoted to extremal and Ramsey 
problems. We also wish to acknowledge the essential contributions of Steve 
Butler who assisted us during the preparation of this edition. In fact Steve’s 
contributions were so decisive that we decided to add him as co-editor to 
these volumes. We also thank Kaitlin Leach (Springer) for her efficiency and 
support. With her presence at the SIAM Discrete Math. conference in Halifax, 
the whole project became more realistic. 

However, we believe that these volumes deserve a little more contem- 
plative introduction in several respects. The nearly 20 years since the first 
edition was prepared gives us a chance to see the mathematics of Paul 
Erdés in perspective. It is easy to say that his mathematics is alive; that 
may sound cliché. But this is in fact an understatement for it seems that 
Erd6s’ mathematics is flourishing. How much it changed since 1995 when 
the first edition was being prepared. How much it changed in the wealth 
of results, new directions and open problems. Many new important results 
have been obtained since then. To name just a few: the distinct distances 
problem, various bounds for Ramsey numbers, various extremal problems, the 
empty convex 6-gon problem, packing and covering problems, sum-product 
phenomena, geometric incidence problems, etc. Many of these are covered by 
articles of this volumes and many of these results relate directly or indirectly 
to problems, results and conjectures of Erdés. Perhaps it is not as active 
a business any more to solve a particular Erdés problem. After all, the 
remaining unsolved problems from his legacy tend to be the harder ones. 
However, many papers quote his work and in a broader sense can be traced 
to him. 

There may be more than meets the eye here. More and more we see that 
the Erdos problems are attacked and sometimes solved by means of tools that 
are not purely combinatorial or elementary, and which originate in the other 
areas of mathematics. And not only that, these connections and applications 
merge to new theories which are investigated on their own and some of which 
belong to very active areas of contemporary mathematics. As if the hard 
problems inspire the development of new tools which then became a coherent 
group of results that may be called theories. This phenomenon is known to 
most professionals and was nicely described by Tim Gowers as two cultures. 
[W. T. Gowers, The two cultures of mathematics, in Mathematics: Frontiers 
and Perspectives (Amer. Math. Soc., Providence, RI, 2000), 65—78.] On one 
side, problem solvers, on the other side, theory builders. Erdés’ mathematics 
seems to be on one side. But perhaps this is misleading. As an example, 
see the article in the first volume Unexpected applications of polynomials 
in combinatorics by Larry Guth and the article in the second volume Flag 
algebras: an interim report by Alexander Razborov for a wealth of theory 
and structural richness. Perhaps, on the top level of selecting problems and 
with persistent activity in solving them, the difference between the two sides 
becomes less clear. (Good) mathematics presents a whole. 
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Time will tell. Perhaps one day we shall see Paul Erdés not as a theory 
builder but as a man whose problems inspired a wealth of theories. 

People outside of mathematics might think of our field as a collection 
of old tricks. The second edition of mathematics of Paul Erdés is a good 
opportunity to see how wrong this popular perception of mathematics is. 


La Jolla, USA R.L. Graham 
Prague, Czech Republic J. NeSetril 
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IN MEMORIAM 


Paul Erdos 


26.3.1913-20.9.1996 


The week before these volumes were scheduled to go to press, we learned that 
Paul Erdos died on September 20, 1996. He was 83. Paul died while attending 
a conference in Warsaw, on his way to another meeting. In this respect, this 
is the way he wanted to “leave”. In fact, the list of his last month’s activities 
alone inspires envy in much younger people. 

Paul was present when the completion of this project was celebrated 
by an elegant dinner in Budapest for some of the authors, editors and 
Springer representatives attending the European Mathematical Congress. He 
was especially pleased to see the first copies of these volumes and was perhaps 
surprised (as were the editors) by the actual size and impact of the collection 
(On the opposite page is the collection of signatures from those present at 
the dinner, taken from the inside cover of the mock-up for these volumes). 
We hope that these volumes will provide a source of inspiration as well as a 
last tribute to one of the great mathematicians of our time. And because of 
the unique lifestyle of Paul Erd6s, a style which did not distinguish between 
life and mathematics, this is perhaps a unique document of our times as well. 


R.L. Graham 
J. Negetiil 


ix 
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In 1992, when Paul Erdés was awarded a Doctor Honoris Causa by Charles 
University in Prague, a small conference was held, bringing together a distin- 
guished group of researchers with interests spanning a variety of fields related 
to Erdés’ own work. At that gathering, the idea occurred to several of us 
that it might be quite appropriate at this point in Erdés’ career to solicit a 
collection of articles illustrating various aspects of Erdés’ mathematical life 
and work. The response to our solicitation was immediate and overwhelming, 
and these volumes are the result. 

Regarding the organization, we found it convenient to arrange the papers 
into six chapters, each mirroring Erdés’ holistic approach to mathematics. 
Our goal was not merely a (random) collection of papers but rather a 
thoroughly edited volume composed in large part by articles explicitly 
solicited to illustrate interesting aspects of Erdés and his life and work. 
Each chapter includes an introduction which often presents a sample of 
related Erdés’ problems “in his own words”. All these (sometimes lengthy) 
introductions were written jointly by editors. 

We wish to thank the nearly 70 contributors for their outstanding efforts 
(and their patience). In particular, we are grateful to Béla Bollobas for his 
extensive documentation of Paul Erd6s’ early years and mathematical high 
points; our other authors are acknowledged in their respective chapters. We 
also want to thank A. Bondy, G. Hahn, I. Ouhel, K. Marx, J. Nateradsky 
and Ché Graham for their help and for the use of their works. At various 
stages of the project, the book was supported by AT&T Bell Laboratories, 
GACR 2167 and GAUK 351. We also are indebted to Dr. Joachim Heinze 
and Springer Verlag for their encouragement and support. Finally, we would 
like to record our extreme debt to Susan Pope (at AT&T Bell Laboratories) 
who somehow (miraculously) managed to convert more than 50 manuscripts 
of all types into the attractive form they now have. 

Here then is a unique portrait of a man who has devoted his whole being 
to “proving and conjecturing” and to the pursuit of mathematical knowledge 
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and understanding. We hope that this will form a lasting tribute to one of 
the great mathematicians of our time. 


Murray Hill, USA R.L. Graham 
Praha, Czech Republic J. NeSetril 
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Paul Erdés on the Queen Elizabeth. ey EAE : 
Photo by Ronald Taft. Paul Erdés in the mountains. 


xiv Illustrations 


eee ba ms Bes = ze 
Paul Erdés with Béla Bollobas. Photo by George Csicsery. 
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Paul Erdés in 1941. 


Paul Erdés. Photo by George Csicsery. 


Paul Erdés in the 1950s. Paul Erdés with Leo Moser. 
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Béla Bollobas 


B. Bollobds (4) 
Department of Pure Mathematics and Mathematical Statistics, University 
of Cambridge, 16 Mill Lane, Cambridge, CB2 1SB, England 


Trinity College, Cambridge, CB2 1TQ, England, UK 


Department of Mathematical Sciences, University of Memphis, 
Memphis, TN 38152, USA 
e-mail: B.Bollobas@dpmms.cam.ac.uk 


Dipping into the mathematical papers of Paul Erdés is like wandering into 
Aladdin’s Cave. The beauty, the variety and the sheer wealth of all that 
one finds is quite overwhelming. There are fundamental papers on number 
theory, probability theory, real analysis, approximation theory, geometry, 
set theory and, especially, combinatorics. These great contributions to 
mathematics span over six decades; Erdés and his collaborators have left 
an indelible mark on the mathematics of the twentieth-century. The areas of 
probabilistic number theory, partition calculus for infinite cardinals, extremal 
combinatorics, and the theory of random graphs have all practically been 
created by Erdés, and no-one has done more to develop and promote the use 
of probabilistic methods throughout mathematics. 

Erdos is the mathematician par excellence: he thrives on mathematics, 
living in a state of continuous excitement; he raises, answers and commu- 
nicates questions, picking up the problems of others and making incisive 
contributions to them with lightning speed. 

Considering what a mild-mannered man he is, it is surprising that 
everything about Erdés and his mathematics is extreme. He has written 
over 1,400 papers, more than any mathematician since Euler, and has more 
than 400 coauthors. If the Guinness Book of Records had categories related 
to mathematical activities, Paul Erdés would hold many of the records by 
a margin one could not even attempt to estimate, like the thousands of 
problems posed, the millions of miles travelled, the tens of thousands of 
mathematical discussions held, the thousands of different beds slept in, the 
thousands of lectures delivered at different universities, the hundreds of 
mathematicians helped, and so on. 

Today we live in the age of big mathematical theories, bringing together 
many sophisticated branches of mathematics. These powerful theories can 
be very successful in solving down-to-earth problems, as in the case of 
Andrew Wiles’s wonderful proof of Fermat’s Last Theorem. But no matter 
how important and valuable these big theories are, they cannot constitute 
all of mathematics. There are a remarkable number of basic mathematical 
questions that we would love to answer (nay, we should answer!) which seem 
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to withstand all our assaults. There is a danger that we turn our backs on 
such questions, persuading ourselves that they are not interesting when in 
fact we mean only that we cannot tackle them with our favourite theories. Of 
course, such an attitude would not be in the proper spirit of science; surely, 
we should say that we do want to answer these questions, by whatever means. 
And if there are no theories to help us, no bulldozers to move the earth, then 
we must rely on our bare hands and ingenuity. It is not that we do not want 
to use big theories to crack our problems, but that the big theories around 
are unable to say anything deep about our questions. And, with luck, our 
hands-on approach will tie up with available theories or, better still, will lead 
to new, more sensitive theories. 

Ernst Straus, who as a young man was Einstein’s assistant, reported 
that the reason why Einstein had chosen physics over mathematics was 
that mathematics was so full of beautiful and attractive questions that one 
might easily waste one’s life working on the “wrong” questions. Einstein was 
confident that in physics he could identify the “central” questions, and he 
felt that it was the duty of a scientist to pursue these questions and not let 
himself be seduced by any problem-no matter how difficult or attractive it 
might be. 

The philosophy of Erdés has been completely different. Throughout 
his long career, he has been happy to pursue the beautiful problems he 
encountered, and has raised many others. But this is not an ad hoc 
process: Erdés has an amazing instinct for discerning beautiful problems 
that, while appearing innocuous, in fact go right to the heart of the matter. 
These problems are not chosen indiscriminately; they frequently lead to the 
discovery of unexpected and exciting phenomena. Like Ramanujan, Erdés 
uses particular instances of problems to explore an area. Rather than taking 
whole countries in one sweeping move, he prefers first to occupy some nearby 
castles, from which he can weigh up the unknown territory before making his 
next move. 

For over 60 years now, Erdés has been the world’s most celebrated 
problem solver and problem poser. Unrivalled, king, nonpareil, .... He has 
been called an occidental Ramanujan, a modern-day Euler, the Mozart of 
mathematics. These glowing epithets accurately capture the different facets 
of Paul Erdés—each is correct in its own way. He has a unique talent to pose 
penetrating questions. It is easy to ask questions that lead nowhere, questions 
that are either impossibly hard or too easy. It is a completely different matter 
to raise, as Erdés does, innocent-looking problems whose solutions shed light 
on the shape of the mathematical landscape. 

An important feature of the problems posed by Erdés is that they 
carry differing monetary rewards. Needless to say, this is done in jest, but 
the prizes do indicate Erdés’s assessment of the difficulty of the problems. 
How different this is from the annoying habit of some mathematicians, who 
casually mention a problem as if they hadn’t even thought about it, when in 
fact they are telling you the central problem they have been working on for 
a long time! 
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Two features of his mathematical ceuvre stand out: his mastery of 
elementary methods and his advocacy of random methods. Starting with his 
very first papers, Erdés championed elementary methods in diverse branches 
of mathematics. He showed, again and again, that elementary methods often 
succeed against overwhelming odds. In many brilliant proofs he showed 
that, rather than bringing somewhat foreign machinery to bear on some 
problems, and thereby trying to fit a square peg into a round hole, one can 
progress considerably further by facing the complications, going deep into the 
problem, and tailoring our approach to the intrinsic difficulties of the problem. 
This philosophy can pay unexpected dividends, as shown by Charles Read’s 
solution of the Invariant Subspace Problem, Miklés Laczkovich’s solution of 
Tarski’s problem of “Squaring the Circle”, and Tim Gowers’ recent solutions 
of Banach’s last unsolved problems, including the Hyperplane Problem. 

As to probabilistic methods, by now it is widely acknowledged that 
these can be remarkably effective in tackling main-line questions in diverse 
areas of mathematics that have nothing to do with probability. It is worth 
remembering, though, that when Erd6s started it all, the idea was very 
startling indeed. That today we take it in our stride is a sign of the tremendous 
success of the random method, which is very much his method, still frequently 
called the Erdés method. 

Paul Erdés was born on 26th March 1913, in Budapest. His parents were 
teachers of mathematics and physics; his father translated a book on aircraft 
design from English into Hungarian. The young Paul did not go to elementary 
school, but was brought up by his devoted mother, Anna, and, for 3 years, 
between the ages of 3 and 6, he had a German Frdulein. His exceptional 
talent for mathematics was evident by the time he was 3: his agility at mental 
arithmetic impressed all comers, and he was not yet 4 when he discovered 
negative numbers for himself. With the outbreak of the First World War, 
his father was drafted into the Austro-Hungarian army, and served on the 
Eastern Front. He was taken prisoner by the Russians, and sent to Siberia to 
a prisoner of war camp, from which he returned only after about 6 years. 

After the unconditional surrender of Hungary at the end of the War, the 
elected government resigned, as it could not accept the terms of the Allies. 
These terms left Hungary only the rump of her territory, and in March 1919 
the communists took over the country, with the explicit aim of repelling 
the Allies. The communists formed a Dictatorship of the Proletariat, usually 
referred to as the Commune, after its French equivalent in 1871, and set about 
defending the territory and forcibly reforming the social order. 

The Commune could not resist the invasion by the Allies and the 
Hungarian “white” officers under Admiral Horlhy, and it fell after a struggle 
of 3 months. Unfortunately for the Erdés family, Anna Erdés had a minor 
post under the Commune, and when Horthy came to power, she lost her job, 
never to teach again. Later she worked as a technical editor. 

He studied elementary school privately with his mother. After that, in 
1922, the young Erd6ds went to Tavaszmez6 gymnasium, the first year as a 
private pupil, the second and third years as a normal student, and the fourth 


4 Béla Bollobas 


year again as a private pupil. After the fourth year he attended St. Stephen’s 
School (Szent Istvan Gimnazium) where his father was a high school teacher. 
At this time Erdés also received significant instruction from his parents as 
well. As it happened, my father entered the school just as Erdés left it, so 
they share many classmates, although they met only many years later. 

By the early 1920s the Mathematical Journal for Secondary Schools 
(K6zépiskolai Matematikai Lapok) was a successful journal, catering for 
pupils with talent for mathematics. The journal had been founded in 1895 
by a visionary young man, Daniel Arany, who hoped to raise the level of 
mathematics in the whole of Hungary by enticing students to mathematics 
through beautiful problems. The backbone of the journal was the year-long 
competition. Every month a number of problems were set for each age group; 
the readers were invited to submit their solutions, which were marked, and 
the best published under the names of the authors. 

The young Erdés became an ardent reader of this journal, and his love 
of mathematics was greatly fanned by the intriguing problems in it. In some 
sense, Erdés’s earliest publications date to this time, with the appearance of 
his solutions in the journal. On one occasion Paul Erdés and Paul Turan were 
the only ones who managed to solve a particular problem, and their solution 
was published under their joint names. This was Erdés’s first “joint paper” 
with Turan, whom he had not even met at the time, and who later became 
one of his closest friends and most important collaborators. 

Mathematicians, and especially young mathematicians, learn much from 
each other. Erdés was very lucky in this respect, for when at the age of 17 
he entered the Pazmany Péter Tudomdnyegyetem (the science university of 
Budapest) he found there an excellent group of about a dozen youngsters 
devoted to mathematics. Not surprisingly, Erdés became the focal point of 
this group, but the long mathematical discussions stimulated him greatly. 

This little group included Paul Turan, the outstanding number theorist; 
Tibor Gallai, the excellent combinatorialist; Dezs6 Lazar, who was later 
tragically killed by the Nazis; George Szekeres and Esther Klein, who later 
married and subsequently emigrated to Australia; Laszlé Alpdr, who became 
an important member of the Hungarian Mathematical Institute; Marta Svéd, 
another member of the group who went to live in Australia; and several 
others. Not only did they discuss mathematics at the university, but also in 
the afternoons and evenings, when they used to meet at various public places, 
especially by the Statue of Anonymous, commemorating the first chronicler 
of Hungarian history. 

Two of Erd6és’s professors stand out: Lipdt Fejér, the great analyst, and 
Dénes Konig, who introduced Erdos to graph theory. The lectures of Konig led 
to the first results of Erdds in graph theory: in answer to a question posed in 
the lectures, in 1931 he extended Menger’s theorem to infinite graphs. Erdés 
never published his proof, but it was reproduced in Ko6nig’s classic, published 
in 1936. 
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As an undergraduate, Erdés worked mostly on number theory, obtaining 
several substantial results. He was not even 20, when in Berlin the great Issai 
Schur lectured on Erdés’s new proof of Bertrand’s postulate. He wrote his 
doctoral dissertation as a second year undergraduate, and it was not long 
before he got into correspondence with several mathematicians in England, 
including Louis Mordell, the great number theorist in Manchester, and 
Richard Rado and Harold Davenport in Cambridge. All three became Erdés’s 
close friends. 

When in 1934 Erddés finished university, he accepted Mordell’s invitation 
to Manchester. He left Hungary for England in the autumn of 1934, not 
knowing that he would never again live in Hungary permanently. On Ist 
October 1934 he was met at the railway station in Cambridge by Davenport 
and Rado, who took him to Trinity College, and they immediately embarked 
on the first of their many long mathematical discussions. Next day Erd6ds met 
Hardy and Littlewood, the giants of English mathematics, before hurrying 
on to Mordell. 

Mordell put together an amazing group of mathematicians in Manchester, 
and Erdés was delighted to join them. First he took up the Bishop Harvey 
Goodwin Fellowship, and was later awarded a Royal Society Fellowship. He 
was free to do research under Mordell’s guidance, and he was soon producing 
papers with astonishing rapidity. In 1937 Davenport left Cambridge to join 
Mordell and Erdos, and their life-long friendship was soon cemented. I have a 
special reason to be grateful for the Erdés-Davenport friendship: many years 
later, I was directed to my present home, Trinity College, Cambridge, only 
because Davenport was a Fellow here, and he was a good friend of Erdos. 

In 1938 Erdés was offered a fellowship at the Institute for Advanced Study 
in Princeton, so he soon thereafter sailed for the U.S., where he was to spend 
the next decade. The war years were rather hard on Erdos, as it was not 
easy to hear from his parents in Budapest, and when he received news, it was 
never good. His father died in August 1942, his mother later had to move 
to the Ghetto in Budapest, and his grandmother died in 1944. Many of his 
relatives were murdered by the Nazis. 

In spite of being cut off from his home, Erdds continued to pour forth 
wonderful mathematics at a prodigious rate. Having arrived in America, he 
spent a year and a half at Princeton, before starting on his travels. He visited 
Philadelphia, Purdue, Notre Dame, Stanford, Syracuse, Johns Hopkins, to 
mention but a few places, and the pattern was set: like a Wandering Scholar 
of the Middle Ages, Erdés never stopped again. In addition to the many 
important papers he wrote by himself, he collaborated more and more with 
mathematicians from diverse areas, writing outstanding joint papers with 
Mark Kac, Wintner, Kai Lai Chung, Ivan Niven, Arye Dvoretzky, Shizuo 
Kakutani, Arthur A. Stone, Leon Alaoglu, Irving Kaplansky, Alfred Tarski, 
Gabor Szegé, William Feller, Fritz Herzog, George Piranian, and others. 
Through correspondence, he continued his collaboration with Paul Turan, 
Harold Davenport, Chao Ko and Tibor Gallai (Griimwald). 
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In 1954, he left the U.S. to attend the International Congress of 
Mathematicians in Amsterdam. He had also asked for a reentry permit at 
that time but his request was denied. So he left without a reentry permit 
since in his own words, “Neither sam nor joe can restrict my right to travel.” 
Left without a country, Israel came to his aid, offering him employment at the 
Hebrew University in Jerusalem, and a passport. He arrived in Israel on 30th 
November 1954, and from then on he has been to Israel practically every year. 
Before leaving Israel for Europe in July 1955, he applied for a return visa to 
Israel. When the officials asked him whether he wanted to become an Israeli 
citizen, he politely refused, saying that he did not believe in citizenship. 

After the upheaval following his trip to Amsterdam, he first returned to 
the U.S. in 1959; the relationship between Erdés and the U.S. Immigration 
Department was finally normalized in 1963, and since then he has had no 
problems with them. 

In the Treaty of Yalta, Hungary was placed within the Soviet sphere of 
influence; the communists, aided by the Russians, took over the government, 
and turned Hungary into a People’s Republic. For ordinary Hungarians, 
leaving Hungary even for short trips to the West became very difficult. 
Nevertheless, in 1955 Erdos managed to return to Hungary for a short 
time, when his good friend, George Alexits, pulled strings and convinced 
the officials that, if Erddés were to enter the country, he should be allowed to 
leave. 

Later Erdés could return to Hungary at frequent intervals, in order to 
spend more and more time with his mother, as well as to collaborate with 
a large number of Hungarian mathematicians, especially Turan and Rényi, 
In those dark days, Erdés was the main link between many Hungarian 
mathematicians and the West. 

As a young pupil, I first heard him lecture during one of his visits: not 
only did he talk about fascinating problems but he also cut a flamboyant 
figure, with his suntan, Western suit and casual mention of countries I was 
sure I could never visit. I got to know him during his next visit: in 1958, 
having won the National Competition, I was summoned to the elegant hotel 
he stayed in with his mother. They could not have been kinder: Erdés told 
to me a host of intriguing questions, and did not talk down to me, while his 
mother (whom, as most of their friends, I learned to call Annus Néni or Aunt 
Anna) treated me to cakes, ice cream and drinks. Three years later they got 
to know my parents, and from then on they were frequent visitors to our 
house, especially for Sunday lunches. My father, who was a physician, looked 
after both Erdés and Annus Néni. 

Seeing them together, there was no doubt that they were very happy 
in each other’s company: these were blissful days for both of them. Erdés 
thoroughly enjoyed being with his mother, and she was delighted to have 
her son back for a while. They looked after each other lovingly; each worried 
whether the other ate well and slept enough or, perhaps, was a little tired. 
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Annus Néni was fiercely proud of her wonderful son, loved to see the many 
signs that her son was a great mathematician, and revelled in her role as 
the Queen Mother of Mathematics, surrounded by all the admirers and well- 
wishers. She was never far from Erdés’s mathematics either: she kept Erdés’s 
hundreds of reprints in perfect order, sending people copies on demand. 

Annus Néni was not young, having been born in 1880, but her health was 
good and she was very sharp. To compensate for the many years when they 
had been kept apart, Annus Néni started to travel with her son in her 80s; 
their first trip together being to Israel in November 1964. From then on they 
travelled much together: to England in 1965, many times to other European 
countries and the U.S., and towards the end of 1968 to Australia and Hawaii. 
When, tinged with envy, we told her that it must be wonderful to see the 
world, she replied “You know that I don’t travel because I like it but to be 
with my son.” It was a tragedy for Erdés when, in 1971, Annus Néni died 
during a trip to Calgary. Her death devastated him and for years afterwards 
he was not quite himself. He still hasn’t recovered from the blow, and it is 
most unlikely that he ever will. 

Erdés’s brushes with officialdom were not quite over: the communists 
also managed to upset him. In 1973 there was an international meeting in 
Hungary, to celebrate his 60th birthday. Erdos’s friends from Israel were 
denied a visa to enter Hungary; this outraged him so much that for 3 years 
he did not return to Hungary. 

With the collapse of communism and with the end of the Cold War, Erdés 
has entered a golden age of travel: not only can he go freely wherever he wants 
to, but he is even welcomed by officials everywhere. 

Having started as a mathematical prodigy, by now Erdos is the doyen 
of mathematicians, with more friends in mathematics than the number of 
people most of us meet in a lifetime. As he likes to put it in his inimitable 
way, he has progressed from prodigy to dotigy. As a Member of the Hungarian 
Academy of Sciences, Erdés has a permanent position in Budapest. During 
summer months, he frequently stays in the Guest House of the Academy, 
two doors away from my mother, visiting Vera Sés, Andras Hajnal, Miklés 
Simonovits, Andras Sarkézy, Miklé6s Laczkovich, and inspiring many others. 
Another permanent position awaits him in Memphis, where he stays and 
works with Ralph Faudree, and his other friends, Dick Schelp and Cecil 
Rousseau. In Israel he visits all the universities, including the Technion in 
Haifa, Tel Aviv, Jerusalem and the Weizman Institute. But for years now, 
Erdés has had many other permanent ports of call, including Kalamazoo, 
where Yousef Alavi looks after him; New Jersey and the New York area, 
where he stays with Ron Graham and Fan Chung and talks to many others 
as well, including Janos Pach, Joel Spencer, Mel Nathanson, Peter Winkler, 
Endre Szemerédi, Joseph Beck and Herb Wilf; Calgary, mostly because of 
Eric Milner, Richard Guy and Norbert Sauer; Atlanta, with Dick Duke, 
Vojtéch Rédl, Ron Gould and Dwight Duffus. And the list could go on and 
on, with Athens, Baton Rouge, Berlin, Bielefeld, Boca Raton, Bonn, Boston, 
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Cambridge, Chicago, Los Angeles, Lyon, Minneapolis, Paris, Poznan, Prague, 
Urbana, Warsaw, Waterloo, and many others. 

Honours have been heaped upon Erdés, although he could not care less. 
Every fifth year there is an International Conference in Cambridge on his 
birthday, and in 1991 Cambridge also awarded him a prestigious Honorary 
Doctorate, as did the Charles University of Prague a year later, and many 
other universities since. On the occasion of his 80th birthday, he was honoured 
at a spate of conferences, not only in Cambridge, but also in Kalamazoo, Boca 
Raton, Prague and Keszthély. 

Nowadays Erdés lectures in more places than ever, interspersing his 
mathematical problems with stories about mathematicians and his remarks 
about life. He dislikes cold but, above all, hates old age and stupidity, and 
so he appreciates the languages in which these evils sound similar. Thus, old 
and cold and alt and kalt go hand in hand in English and German, and in no 
other language he knows. But Hindi is better still because the two greatest 
evils sound almost the same: buddha is old and budu is stupid. 

Erdés is fond of paraphrasing poems, especially Hungarian poems, to 
illustrate various points. The great Hungarian poet at the beginning of this 
century, Endre Ady, Wrote: Legyen dtkozott aki a helyembe dll! (Let him be 
cursed who takes my place!) As a mathematician builds the work of others, so 
that his immortality depends on those who continue his work, Erdés professes 
the opposite: Let him be blessed who takes my place! 

But Erdés does not wait for posterity to find people to continue his work: 
his extraordinary number of collaborators ensures that many people carryon 
his work all around the world. The collaborators who particularly stand out 
are Paul Turan, Harold Davenport, Richard Rado, Mark Kac, Alfréd Rényi, 
Andras Hajnal, Andras Sarkézy, Vera Sds and Ron Graham: they have all 
done much major work with Erd6és. In a moment we shall see a brief account 
of some of this work. Needless to say, our review of Erdés’s mathematics 
will be woefully brief and inadequate, and will also reflect the taste of the 
reviewer. 

Erdés wrote his first paper as a first-year undergraduate, on Bertrand’s 
postulate that, for every n > 1, there is a prime p satisfyingn < p < 
2n. Bertrand’s postulate was first proved by Chebyshev, but the original 
proof was rather involved, and in 1919 Ramanujan gave a considerably 
simpler proof of it. In his fundamental book, Vorlesungen tiber Zahlentheorie, 
published in Leipzig in 1927, Landau gave a rather simple proof of the 
assertion that for some q > 1 and every n > 1, there is a prime between 
n and qn. However, Landau’s g could not be taken to be 2. In his first paper, 
Erdos sharpened Landau’s argument, and by studying the prime factors of the 
binomial coefficient (Cy, gave a simple and elementary proof of Bertrand’s 
postulate. 

Erd6és was quick to develop further the ideas in his first paper. In 1932, 
Breusch made use of L-functions to generalize Bertrand’s postulate to the 
arithmetic progressions 3n+1, 3n+2, 4n+1 and 4n+3: for every m > 7 there 
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are primes of the form 8n+1, 8n+2, 4n+1 and 4n+3 between m and 2m. 
By constructing expressions containing, as factors, all terms of the arithmetic 
progression at hand, and rather few other factors, Erdés managed to give an 
elementary proof of Breusch’s theorem, together with various extensions of it 
to other arithmetic progressions. These results constituted the Ph.D. thesis 
Erdés wrote as a second-year undergraduate, and published in Saérospatak in 
1934. 

Schur, who had been Breusch’s supervisor in Berlin, was quick to 
recognize the genius of the author of the beautiful elementary proof of 
Breusch’s theorem. When, a little later, Erdés proved a conjecture of Schur 
on abundant numbers, and solved another problem of Schur, Erdés became 
“der Zauberer von Budapest” (“the magician of Budapest” )—no small praise 
from the great German for a young man of 20. 

Abundant numbers figured prominently among the early problems tackled 
by Erdés. In his lectures on number theory, Schur conjectured that the 
abundant numbers have positive density: limz_,.. A(x)/zx exists, where A(z) 
is the number of abundant numbers not exceeding xz. (A natural number 
n is abundant if o(n), the sum of its positive divisors, is at least 2n.) The 
beautiful elementary proof Erdés gave of this conjecture led him straight 
to other problems concerning the distribution of the values of real-valued 
additive arithmetical functions f(n), that is functions f : N > R satisfying 
f(ab) = f(a) + f(b) whenever (a,b) = 1. 

These problems were first investigated by Hardy and Ramanujan in 1917, 
but were more or less forgotten for over a decade. As eventually proved by 
Erdés and Wintner in 1939, a real-valued additive arithmetical function f(n) 
behaves rather well if the following three series are convergent: 


S” i/p, => fe)/p and SO fp)? /p, 


If(p)|>1 If(p)|S1 If(p)|S1 


with the summations over primes p. To be precise, the three series above are 
convergent if and only if lim;_,.. A(x)/a exists for every real c, where A,(x) 
stands for the number of natural numbers n < x with f(n) <c. 

In 1934, Turan gave a marvelous proof of an extension of the Hardy- 
Ramanujan theorem on the “typical number of divisors” of a natural number. 
Writing v(n) for the number of distinct prime factors of n (so that v(12) = 2), 
Turan proved that 


N 
S{v(n) —loglogn}? = N loglog N + o(N log log N). 
n=1 


It is a little disappointing that Hardy, one of the greatest mathematicians 
alive, failed to recognize the immense significance of this new proof. Erdés, 
on the other hand, not only saw the significance of the paper, but was quick 
to make use of the probabilistic approach and so became instrumental in 
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the birth of a very fruitful new branch of mathematics, probabilistic number 
theory. In a ground-breaking joint paper he wrote with Kac in 1939, Erdés 
proved that if a bounded real-valued arithmetical function f(n) satisfies 
ee f (p)?/p = 00 then, for every x € R, 


lim_ Ap(m)/m = (2), 


where A,(m) is the number of positive integers n < m satisfying 


1/2 
fny< So flo)fo+ e( S- F(0)*/p) 5 


pom pom 


and, as usual 


1 = 2 
P(x) = =z / e! /2 dt 


is the standard normal distribution. In other words, the arithmetical function 
f(n) satisfies the Gaussian law of error! It took the mathematical community 
quite a while to appreciate the significance and potential of results of this 
type. 

Note that for v(n), the number of prime factors of n, the Erdés-Kac 
theorem says that if x € R is fixed then 


1 
lim —|{n < mand y(n) < loglogm + a(log log m)'/?}| = G(x). 


moo m 


Starting with his very first papers, Erdos championed “elementary” meth- 
ods in number theory. That the number theorists in the 1930s appreciated 
elementary methods was due, to some extent, to Shnirelman’s great success in 
studying integer sequences, with a view of attacking, perhaps, the Goldbach 
conjecture. To study integer sequences, Shnirelman introduced a density, now 
bearing his name: an integer sequence 0 < a,,< ag < ... is said to have 
Shnirelman density a if 


1 
we 
Thus if a; > 1 then the Shnirelman density of the sequence (a,,)°, is 0. 

Khintchine discovered the rather surprising fact that if (a,)°°, is an 
integer sequence of Shnirelman density a with 0 <a < 1, and (b,)°2, is the 
sequence of squares 07,17, 27,..., then the “sum-sequence” (ay + b)) has 
Shnirelman density strictly greater than a. The original proof of this result, 
although elementary, was rather involved. 

When Landau lectured on Khintchine’s theorem in 1935 in Cambridge, 
he presented a somewhat simplified proof he had found with Buchstab. 
Nevertheless, talking to Landau after his lecture, Erdés expressed his view 
that the proof should be considerably simpler and, to Landau’s astonishment, 
as soon as the next day he came up with a “proper” proof that was both 
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elementary and short. In addition, the new proof also made it clear what the 
result had to do with squares: all one needs is that every positive integer is 
the sum of at most four squares. If (b,,) is such that every positive integer 
is the sum of at most & terms b,, then the sum-sequence (a, + by) has 
Shnirelman density at least a + a(1 — a)/2k. It says much about Landau, 
that he immediately included this beautiful theorem of Erdés into the 
Cambridge “Tract” he was writing at the time (Neue Ergebnisse der additiven 
Zahlentheorie, published in 1937). 

The difference between consecutive primes has attracted much attention. 
Writing p,, for the nth prime, the twin prime conjecture states that Pn+1—Dn 
is infinitely often equal to 2, that is liminfy,..(pn41 — pn) = 2. At 
the moment we seem to be very far from a proof of this conjecture; in fact, 
there seems to be no hope to prove that liminfy+.(pn41 — Pn) < oo. The 
Prime Number Theorem, asserting that 1(x) ~ x/logx, where (x) is the 
number of primes p < x, implies that c = lim infy-.o0o(Pn41 —Pn)/logpn < 1, 
but Erdés was the first to prove, in 1940, that c < 1. Later Rankin showed 
that c < 59/60, and then Selberg that c < 15/16. Subsequent improvements 
were obtained by Bombieri and Davenport and by Huxley; the present record, 
c < 0.248, is held by Maier. 

Independently, Erdés and Ricci showed that the set of limit points of 
the sequence (pn41 — Pn)/logpn has positive Lebesgue density and yet, no 
number is known to be a limit point. 

Concerning large gaps between consecutive primes, Backlund proved in 
1929 that limsup,,_,.,(Pn+41 — Pn)/logpyn > 2. In quick succession, this was 
improved by Brauer and Zeitz (1930), by Westzynthius (1931), and then by 
Ricci (1934), to 


li ; Pn+1—Pn 

im sup 

n>co log py log log log p, 

By making use of the method of Brauer and Zeitz, Erdés proved in 1934 that 


‘ (Pn+1—Pn) (log log log pn)? 
im sup 
ates log pn log log pn 


In 1938 this result was improved by Rankin, who smuggled a factor 
log log log log pp, into the denominator: there is a c > 0 such that 


> 0. 


log pn log log pn log log log log pn (1) 
(log log log pn)? 
for infinitely many values of n. It seems to be extremely difficult to improve 
this result, to the extent that Erdés is offering (according to him, perhaps a 
little rashly) $10,000 for a proof that (1) holds for every c. The original value 
of c given by Rankin was improved by Maier and Pomerance in 1990. 
Although in the 1930s elementary methods were spectacularly successful 
in additive number theory and in the study of additive arithmetical functions, 
they did not seem to be suitable for the study of the distribution of primes. 


Pn+1— Pn > € 
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It was not only a desire for diverse proofs that urged mathematicians to search 
for elementary proofs of results proved by deep analytical methods: many 
mathematicians, including Hardy, felt that if the Prime Number Theorem 
(PNT) could be proved by elementary methods then the Riemann Hypothesis 
itself might yield to a similar attack. This belief was reinforced by the result 
of Norbert Wiener in 1930 that the prime number theorem is equivalent to 
the fact that the zeta function ¢(s) = ¢(a + tt) has no zero on the line o = 1. 

Next to the PNT, Dirichlet’s classical theorem on primes in an arithmeti- 
cal progression, proved in 1837, was a test case for the power of elementary 
methods. In 1948 Atle Selberg found an ingenious elementary proof of 
Dirichlet’s theorem; indeed, Selberg proved that if & and / are relatively 
prime numbers then 


1 
lim inf S- p ‘logp>0. 


£00 log x 


p<«x;p=l(mod k) 


Shortly after this, Selberg proved the following fundamental formula: 
So (logp?) + > log plogq = 2x log x + O(c), (2) 


psx pqgax 


where p and gq run over primes. This formula is an easy consequence of the 
PNT, but what caused the excitement was that Selberg gave a completely 
elementary proof. Thus the fundamental formula could be a starting point 
for elementary proofs of various theorems in number theory which previously 
seemed inaccessible by elementary methods. 

Using Selberg’s fundamental formula, Erdés quickly proved that p,+1/ 
Pn > 1 as n— oo, where p, is, as before, the nth prime. Even more, Erdés 
proved (in an entirely elementary way) that if c > 1 then 


(m(cx) — r(x)) > 0. (3) 


Erdés communicated this proof of (3) to Selberg, who, 2 days later, using 
(2), (3) and the ideas in the proof of (3), deduced the PNT limy-+.. w(e) loge 
= 1. Thus an elementary proof of the PNT was found! 

A little later Selberg found it possible to argue directly from (2), without 
making any use of (3); this is the way he wrote up his paper in the autumn of 
1948. In a separate paper, Erdés stated (2), referred to Selberg’s final proof 
of the PNT (not published at the time), gave his own proof of (3), Selberg’s 
deduction of the PNT from (2) and (3), and a joint simplified deduction 
of the PNT from (2). In the Mathematical Reviews the great Cambridge 
mathematician A.E. Ingham found it convenient to review these two papers 
together. As he wrote, “All previous proofs have been by ‘transcendental’ 
arguments involving some appeal to the theory of functions of a complex 
variable. Successive proofs have moderated the demands on this theory, or 
invoked alternative analytical theories (e.g., Fourier transforms), but there 
remained a nucleus of complex variable theory, namely the proposition that 


l 
lim inf eet 
noo Lz 
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Riemann zeta-function ¢(s) = ¢(a + it) has no zeros on the line o = 1; and 
this could hardly be avoided, except by a radically new approach, since the 
PNT is in a clearly definable sense ‘equivalent’ to this property of ¢(s). It has 
long been recognized that an ‘elementary’ proof of the PNT, not depending 
on analytical ideas remote from the problem itself, would (if indeed possible) 
constitute a discovery of the first importance for the logical structure of the 
theory of the distribution of primes. An elementary (though not easy) proof 
is given, in various forms, in these two papers”. 

“In principle, [the papers] open up the possibility of a new approach, in 
which the old logical arrangement is reversed and analytical properties of ¢(s) 
are deduced from arithmetical properties of the sequence of primes. How far 
the practical effects of this revolution of ideas penetrate into the structure of 
the subject, and how much of the theory will ultimately have to be rewritten, 
it is too early to say.” 

For the startling elementary proof of the Prime Number Theorem Selberg 
was awarded a Fields Medal, and Erdés a Cole Prize, given every fourth year 
to the author of the best paper in algebra and number theory published in 
an American journal. 

Let us say a few words about the contributions of Erdés to asymptotic 
formulae. One of the glorious achievements of the Hardy-Ramanujan partner- 
ship was the striking formula for p(n), the number of different partitions of n 
(ignoring the order of the summands). By using powerful analytic methods 
that eventually led to the celebrated circle method of Hardy and Littlewood, 
in 1918 Hardy and Ramanujan gave an extremely good approximation for 
p(n); later Rademacher improved the approximation a little and turned it 
into an analytic expression for p(n). A weak form of the Hardy-Ramanujan 
result states that 

p(n) ~ meV, 
4y/3n 


and Hardy and Ramanujan also gave an elementary proof of 
log p(n) ~ m/2n/3. 
Two decades later, Erdés set about proving that elementary methods can go 
considerably further, and in 1942 he proved that 
a 
p(n) by ee 
for some positive constant a. 

Taking his cue from the Hardy-Ramanujan result mentioned a little 
earlier, that most integers n have about loglogn prime factors, Erdés also 
proved, with Lehner, that “almost all” partitions of a positive integer n 
contain about 


A(n) = * /3n/Blogn 
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summands. Furthermore, there is also a beautiful distribution about A(n): for 
x €R, the probability that a random partition of n has at most A(n) + a/n 
summands tends to 


ge ee. 

Some years later, in 1946, Erd6és returned to another variant of this problem. 
Given n € N, what is the most likely number of summands in a random 
partition of n? Writing ko(n) for this number, it is not clear that ko(n) is well- 
defined although, as was shown later by Szekeres, this is the case. However, 
what does seem to be clear is that k(n) is about A(n). Erdés proved that, 
in fact, 


Another circle of problems that has occupied Erdés for over 60 years 
originated with a question raised by Sidon when Erdés and Turan went to 
see him. Given a sequence S of natural numbers and k € N, write r,(n) for 
the number of representations of n in the form 


n=ay+a2+°':+4k, 


with a; € S and 1 < a, < ag <... < ag. Call S an asymptotic basis of 
order k if rx(n) > 1 whenever n is sufficiently large. In 1932 Sidon asked 
Erdos the following question. Is there an asymptotic basis of order 2 such 
that r2(n) = o(n*) for every « > 0? The young Erdés confidently reassured 
Sidon that he would come up with such a sequence. Erdés was right, but it 
took him over 20 years: he proved in 1954 in Acta (Szeged) that for some 
constant c there is a sequence S such that 


1<ro(n) < clogn 


if n is large enough. 

What can one say about r;,(n) rather than r2(n)? In 1990, Erdés and 
Tetali proved that for every k > 2 there are positive constants c,,co, and 
a sequence S such that c,logn < rz(n) < cologn if n is large enough. 
In fact, Erdos and Tetali gave two proofs of this theorem; the easier of 
the two gets the result as a fairly simple consequence of Janson’s powerful 
and ingenious correlation inequality. The related conjecture of Erdés and 
Turan, made in 1941, that if ro(n) > 1 for all sufficiently large n then 
limsup,_,.72(n) = ov, is still far from being solved, although it seems 
possible that much more is true, namely if ro(n) > 1 whenever n is large 
enough then lim sup,,_,,, r2(n)/logn > 0. 

In 1956, Erdés and Fuchs proved a remarkable theorem somewhat related 
to Sidon’s problem but originating in a result of Hardy and Landau. Let us 
write r(n) for the number of lattice points in Z? in the circle of radius \/n, 
so that r(n) is the number of integer solutions of the inequality x? + y? <n. 
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Gauss was the first to prove that r(n) stays rather close to its expectation, 
namely r(n) — mn = O(n'/?). In 1906, Sierpiriski returned to the study of 
r(n), and showed that, in fact, r(n) — (n) = O(n'/3). The question whether 
this bound is essentially best possible or could be improved: intrigued many 
of the best number theorists in the first few decades of this century: including 
Hardy, Littlewood, Landau and Walfisz. In 1925 Hardy and Landau gave an 
exact expression for r(n) — 7(n) is terms of Bessel functions. They showed 
also that r(n) does not stay too close to its expectation 7(n), namely that 


fim sup 70") = *(") 


0. 
noo (nlogn)1/4 = 


Erdés and Fuchs, proved that this result has nothing to do with the sequence 
of squares 07, 12,... but it holds in great generality. Indeed, let 0 < a; < ag < 
--- be any sequence of integers, and for n € N let r*(n) be the number of 
solutions of the inequality a; + a; <n. Then, as proved by Erdés and Fuchs 
for every positive real a we have 


|r*(n) = an| 


lim sup > 0. 


noo (nlogn)}/4 

Erdés contributed much to the theory of diophantine approximation. 

Recall that a sequence (dn) C [0,1] is said to be uniformly distributed if 
for allO <a< 6 <1 we have 


ie = So 1=f-a. (4) 


n—-oo 
K<N,A< bn. <B 


Weyl! proved in 1916 that (¢,) C [0,1] is uniformly distributed if, and only if, 


: I . 2Tikd; 
co dee 

for every non-zero integer k, Needless to say, this necessary and sufficient 
condition gives no information about the speed in (4). To get some 
information about the speed of convergence, one needs a “finite” form of 
Weyl’s criterion. A finite form was given by van der Corput and Koksma in 
1936, but a stronger conjecture of Koksma in his 1936 book on Diophantine 
approximation remained unproved until 1948, when Erdés and Turan proved 
the following remarkable theorem. 

Let ¢1,..-,¢n € [0,1], and set sy = )> j = 1%e?"**4s. Suppose that |s,| < 
w(k) for k = 1,...,m. Then for allO0 <a < 8 < 1 we have 


(B-ayn- SD ) € of +> ways} 


asd; <B 


for some absolute constant C. 


16 Béla Bollobas 


This result has had numerous applications, starting with the following 
beautiful theorem from the original Erdés-Turan paper. For n > 2, let 
f(z) =2” +0n,2" * +++ +a1zZ+a9 = [[¢ — 25) 
j=l 
be such that z;| > 1 for every j. For 0 < @ < 1 set Mg = maxj,)~» |f(z)| and 
define g(n, 0),2 < g(n,0) < n, by 


Mo/ ao] = €*/98), 
Then for all 0 < a@ < 6 < 27 we have 


n 


log g(n, 4)’ 


B-a 

moe S- i < C log(4/@) 
a<arg 24<6 

where C is an absolute constant. 

Note that if Mg/./|ao| is “not too large”, say at most eV”, then the error 
term above is O(n/logn). 

Other applications were found by Egervary and Turan, Kornyei, and 
others. When, in 1988, Laczkovich cracked Tarski’s 50-year old problem on 
squaring the circle, he made substantial use of this theorem of Erdds and 
Turan from 1948. 

There are very few people who have contributed more to the fundamental 
theorems in probability theory than Paul Erdds; here we shall state only 
a small fraction of the major results of Erdds in probability theory. The 
law of the iterated logarithm was proved around 1930 by Khintchine and 
Kolmogorov, with further contributions from Lévy. To state this fundamental 
result, let X ,X2,... be independent Bernoulli random variables, with 
P(X, = —-1) = P(X, = 1) = § for every n, and set S, = )>j_, Xi. The 
law of the iterated logarithm states that limsup,, ,,, Sn/W2nloglogn = 1 
almost surely. Putting it another way, for t € [0,1], let t = 0.e,(t)eo(t)... 
be its dyadic expansion, or equivalently, set €,,(¢) = 0 or 1 according as the 
integer part of 2"t is even or odd. (Thus the variables X,,(¢) = 2e,(t) — 1 are 
as above.) Set fn(t) = S0¢_, ex(t) — #. Then the law of iterated logarithm 
states that 


Fn(t) 
lims = 
nice (Blog logn)'2 
for almost every t € (0, 1]. 
Let ¢(n) be a monotone increasing non-negative function defined for all 
sufficiently large integers. Following Lévy, this function ¢(n) is said to belong 
to the upper class if, for almost all t, 


En(t) S O(n) 


provided n is sufficiently large, and it belongs to the lower class if, for almost 
all t, f(t) > o(n) for infinitely many values of n. Then the law of the iterated 
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logarithm states that d(n) = (1+e)(# log log n)!/? belongs to the upper class 
if € > 0, and to the lower class if € < 0. 

In 1942, Erdés considerably sharpened this assertion when he proved that 
a function 


1/2 
(ackex) {loglogn + 2logz;n + $logyn+... 
5 log,_1 n+ (5 +6) log, n} 


belongs to the upper class if « >0, and to the lower class if € >0. (We write 
log, for the & times iterated logarithm.) Not surprisingly, Erdés gave 
an elementary proof, and made no use of the results of Khintchine and 
Kolmogorov. Furthermore, as he indicated, the result could easily be extended 
to the case of Brownian motion. Some years later, Erdés returned to this topic 
in a joint paper with K.L. Chung. 

In addition to the papers that were instrumental in creating probabilistic 
number theory, Erdés wrote some important papers with Mark Kac proving 
several basic results of probability theory. Let X,, X2,...,Xn be independent 
random variables, each with mean 0 and expectation 1. As before, set 
Sh = bya X1,k =1,...,n. In 1946, Erdés and Kac determined the limiting 
distributions of maxi<g<n Sp and maxi<k<n |Sz|, which turned out to be 
independent of the distribution of the X;. 

Although this result was important, the method of proof was even more 
so: Erddés and Kac proved that if the theorem can be established for one 
particular sequence of independent random variables satisfying the conditions 
of the theorem, then the conclusion of the theorem holds for all sequences 
of independent random variables satisfying the conditions of the theorem. 
Erdos and Kac called this the invariance principle. Since then, this principle 
has been widely applied in probability theory. 

Erdés and Kac promptly proceeded to apply their powerful invariance 
principle to extending a beautiful result of Paul Lévy, proved in 1939. To 
state this result, let X , X2,... be independent random variables, each with 
mean 0 and variance 1, such that the central limit theorem holds for the 
sequence. As before, let S, = X; +--:+ Xx, and let N, be the number of 
S;, 1 < k <n, which are positive. Erdés and Kac proved in 1947 that, in 
this case, 

lim P(N,,/n < vas in a/? 

Jim n/? <x) = — arcsins 
for all 4,0 < x < 1. Thus N,,/n tends in distribution to the arc sin 
distribution. 

What Paul Lévy had proved in 1939 is that this arcsin law holds in the 
binomial case P(X; = 1) = P(X, = -1) = 1/2. 

In 1953 Erdos returned to this theme. In a joint paper with Hunt he 
proved that if X 1, X2,... are independent zero-mean random variables with 
the same continuous distribution which is symmetric about 0 then, almost 
surely, 
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In his joint papers with Dvoretzky, Kac and Kakutani, Erd6és contributed 
much to the theory of random walks and Brownian motion. For example, 
in 1940, Paul Lévy proved that almost all paths of a Brownian motion in 
the plane have double points. This was extended by Dvoretzky, Erdés and 
Kakutani in 1950: they proved that for n < 3 almost all paths of a Brownian 
motion in R” have double points, but for n > 4 almost all paths of a Brownian 
motion in R” are free of double points. In 1954, in a paper dedicated to Albert 
Einstein on his 75th birthday, Dvoretzky, Erdés and Kakutani returned to 
this topic, and proved that almost all paths of a Brownian motion in the 
plane have k-multiple points for every k, k = 2,3...; in fact, for almost all 
paths the set of k-multiple points is dense in the plane. 

Let us say a few words about classical measure theory. A subset of a 
metric space is of first category if it is a countable union of nowhere dense 
sets. There are a good many striking similarities between the class of nullsets 
and the class of sets of first category on the line. Indeed, both are o-ideals 
(i.e. o-rings closed under taking subsets), both include all countable sets and 
contain some sets of cardinality c, both classes have power 2°, both classes 
are invariant under translation, neither class contains an interval, in fact, the 
complement of any set of either class is a set dense in R, the complement of 
any set of either class contains a member of the class with cardinality c, and 
so on. 

Of course, neither class includes the other; also, it is easily seen that 
R = AUB, with A of first category and B a nullset. Nevertheless, the existence 
of numerous common properties suggests that the two o-ideals are similar in 
the sense that there is a one-to-one mapping f : R > R such that f(F) isa 
nullset if and only if F is of first category. In 1934 Sierpinski proved that this 
is indeed the case, provided we assume the continuum hypothesis. Sierpinski 
went on to ask whether the stronger assertion is also true that, assuming 
the continuum hypothesis, there is a function simultaneously mapping the 
two classes into each other. In 1943 Erdds answered this question in the 
affirmative. 

Assuming the continuum hypothesis, there is a one-to-one map f : RR 
such that f(£) is a nullset if and only if F is of first category, and f(£) is of 
first category if and only if F is a nullset. In fact, f can be chosen to satisfy 
f=fr'. 

Of the many results of Erdés in approximation theory, let us mention some 
beautiful theorems concerning Lagrange interpolation. Let X = {xi n},n = 
1,2,...,4=1,2,...,n, be a triangular matrix with 


—1< ain < fon <-++ << Inn <1 (5) 
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for every n. The values x;,, are the nodes of interpolation. As usual, for 
1<k<n, define the fundamental polynomials 1;,,(X) as 


n 
len (X) = [[@ _ eal [[@en _ Lez): 
if j= ij 
so that lyn (X) is the unique polynomial of degree n—1 with zeros at %jin,1 4 
k, with lin(Xk,n) = 
The Lebesgue functions and the Lebesgue constants of the interpolation 
are 
n 
An(v) = » Idkn(x)| and A» = _max | An(2). 
c=1 
In fact, one frequently considers a generalization of the Lebesgue constants 
as well: for -l1 <a<b< 1 set 
Akt 0) mane, An( 2); 
so that A, = An (—1,1). 
Faber showed before the First World War that if X is any set of nodes 
satisfying (5) then A, > z logn for every n. 
Much research was done on improving this inequality. After a series of 
papers with Turan, in 1942 Erdds proved the asymptotically best possible 
inequality that 


2 
An > —logn+ O(1) 
as 


for every matrix X. 
In 1931 Faber’s inequality was extended by Bernstein, who proved that 
there is an absolute constant c > 0 such that 


An(a, b) > clogn, 


provided —1<a<b<1, and n is sufficiently large, depending on (a,b). In 
other words, the L..-norm of the restriction of \,,(X) to the interval (a, b) 
grows at least as fast as clogn. 

In a beautiful paper, written jointly with Szabados, Erdés proved in 1978 
the much stronger result that a similar assertion holds for the normalized 
Ly-norms. 

There is an absolute constant c > 0 such that if X is an arbitrary system 
of nodes satisfying (5), -l1<a<b<1, and n is sufficiently large, then 


b 
/ An(x) dx > c(b— a) logn. 


In the special case a = —1, b = 1, the result had been announced by Erdés in 
1961, but the proof in the Erdés-Szabados paper in 1978 was along different 
lines and simpler. 
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Let us turn to a substantial extension of some classical results of Faber 
and Bernstein. Given a system X of nodes satisfying (5), and a function F’ 
n [—1, 1], let 


n(F, X, 2) =F Pada x) 


be the nth Lagrange interpolation polynomial of F. Thus L,(F, X,x) is the 
unique polynomial of degree at most n — 1 whose value at rpp, is F(Xpn), 1 < 
k <n. Extending a result of Faber from 1914, Bernstein proved in 1931 that, 
for every triangular matrix X satisfying (5), there is a continuous function 
F and a point 29, —1 < 2 < 1, such that 

lim sup |L,,(F, X, xo)| = 00. (6) 

noo 

In 1936, Géza Griinwald and Marcienkiewicz proved that if X is the “good” 
Chebyshev matrix then for some continuous function F' relation (6) holds for 
almost every 29, and 1978 Privalov proved the same assertion for the class 
of Jacobi matrices. 

After these results concerning special classes of matrices, in 1980 Erdés 
and Vértesi proved the striking result that a similar assertion holds for every 
matrix X satisfying (5): there is always a continuous function F such that (6) 
holds for almost every xo. The proof is intricate and ingenious. 

To conclude our brief list of results on approximation theory, let us return 
to an early major result of Erdds. It has been known since Newton that 
interpolation polynomials can be used to approximate definite integrals of 
functions. Indeed, as proved by Stieltjes, if the nth row of X consists of the 
roots of the nth Legendre polynomial then 

1 1 
lim In(B, X, x) dx = F(a) dx 
for every Riemann integrable function F. 

Later this result was extended to other matrices X formed by the zeros of 
polynomials that were orthogonal in [—1, 1] with respect to a weight function 
of the form (1 — x)*(1 +)° for some a and 8. However, this was not known 
for any general class of weight functions; furthermore, the result of Stieltjes 
could not even be sharpened to 


1 
lim |L,(F, X,x) — F(a)| da = 1. 


noo J_4 
In 1937, Erdés and Turan solved both problems. Let p(z) > M > 0 be 
Riemann integrable over [—1,1], and let wo(x),wi(zx),... be orthogonal 


polynomials in [—1,1] with respect to p(x), with w,(x) being a monic 
polynomial of degree n. Let A,,, By, be constants with B, <0 such that 


Ryr(x) = wy(@) + Anwn—1(2) + Bywn—2(z) 
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has n different roots in [—1,1], and let X be the set of nodes formed by the 
roots of the polynomials R,, R2,.... Erdos and Turan proved that in this case 
every Riemann integrable function F(x) on [—1, 1] satisfies 


1 
lim |L,,(F, X,x) — F(x)| dx = 0. 
noo J_4 

Much of Erdés’s work in real analysis concerns so-called Tauberian 
theorems. The origin of these results is a theorem of Tauber stating that 
if oa,a" > s as x > 1-, and na, — 0 as n > ov, then Day is 
convergent (to sum s). Hence if na, > 0 and S> a, is Cesdro summable then 
>> dy, is convergent. Soon after the turn of the century, Landau, Hardy and 
Littlewood founded a flourishing branch of analysis by making extensive use 
of deep results resembling this theorem of Tauber. These Tauberian theorems 
claim that if a series is summable with a certain method of summation and 
satisfies certain additional conditions then it is also summable with a weaker 
method of summation. For example, Hardy and Littlewood proved in 1911 
that if SS an is Borel summable (i.e. limy+oe7* >> 8nx"/n! exists, where 
Sn = a, +...+ an) and V/nan > 0 then Sian is convergent. The second 
part of the elementary proof of the PNT was, essentially, such a Tauberian 
theorem. 

Shortly after the elementary proofs of the Prime Number Theorem 
were found, Erdés proved that the PNT can be deduced from Selberg’s 
fundamental formula alone, without any reference to other properties of the 
sequence of primes. What Erdés needed was the following Tauberian theorem: 
if a, > 0 and 07, ak(Sn—-“ +k) =n? + O(n) then s, = n+ O(1). Here, as 
before, $s, = a, +---+4n. 

Hardy and Littlewood also considered lacunary series and proved, among 
others, that under certain lacunarity conditions Abel-summability implies 
summability. In 1943, Meyer-Konig proved a similar lacunarity theorem for 
Euler summability: if > an is Euler summable (i.e. limp+oo 27” psa eal Sk 
exists) and a, = 0 except ifn =n;, wheren, <ng <--- satisfies nj41/nj > 
c> 1, then > a, is convergent. Meyer-Konig went on to conjecture the much 
stronger assertion that instead of nj41/n; > c > 1 it suffices to demand that 
N41 — 4 > A,/n; for some A > 0. In 1952 Erdés came very close to proving 
this conjecture: he showed that the assertion is true if A > 0 is sufficiently 
large. 

Another Tauberian theorem of Erdés, proved with Feller and Pollard in 
1949, is important in the theory of Markov chains. Let po,pi,... be non- 
negative, with )> py = 1 and p = >> kpp, and suppose that P(z) = >>> ppz® 
is not a power series in z’ for any integer t > 1. Then |P(z)| < 1 for |z| < 1; 
in particular, (1 — P(z))~! is analytic in |z| < 1, say 


1 = ‘ 
— = UZ. 
1— P(z) » 
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The Erdés-Feller-Pollard Theorem states that limp... up = 1/pif uw < co and 
Up — oO if uw = oo. The theorem has important consequences in probability 
theory, and in 1951 de Bruijn and Erdés also used it to study recursion 
formulae. 

In a beautiful paper written with Niven in 1948, Erdés extended a result 
relating the zeros of a complex polynomial to the zeros of its derivative. 
Among other results, Erdés and Niven proved that if r1,1r2,..-,1%n are the 
zeros of a complex polynomial, and Ry, Ro,...,Rn—1 are the zeros of its 
derivative then 


1 n 1 n-1 
ey eee Rj 
j=l j=l 


for every z € C, with equality if, and only if, all the, zeros r; are on a half-line 
emanating from z. 

In a difficult paper written with Szeg6 in 1942, Erdés tackled a problem 
concerning real polynomials. Extending Markov’s classical theorem that if a 
polynomial f of degree n satisfies |f(x)| < 1 for -1 < x# <1, then |f’(a)| < n? 
for —1 < # < 1, Schur proved in 1919 that if f is a polynomial of degree n 
with |f(x)| <1 for -1 <a <1, then |f"(ao) < $n”, provided -1 < x <1 
and f” (ao) = 0. Writing m,, for the smallest constant that would do in the 
inequality above instead of $, Erdés and Szeg6 proved that for n > 3 the 
extremum m,n? is attained for zo = 1 (or —1) and the so-called Zolotarev 
polynomials. This enabled Erdos and Szeg6 to determine lim,_-,.5 Mn as well 
(which turned out to be 0.3124 ...). 

Whatever branch of mathematics Erdés works in, in spirit and attitude 
he is a combinatorialist: his strength is the hands-on approach, making use 
of ingenious elementary methods. Therefore it is not surprising that Erdés 
helped to shape twentieth-century combinatorics as no-one else: with his 
results, problems, and influence on people, much of combinatorics in this 
century owes its existence to Erdos. 

One of the fundamental results in combinatorics is a theorem (to be 
precise, a pair of theorems) proved by F.P. Ramsey in 1930. Erdés was the first 
to realize the tremendous importance of this “super pigeon-hole principle”, 
and did much to turn Ramsey’s finite theorem into Ramsey theory, a rich 
branch of combinatorics, as witnessed by the excellent monograph of Graham, 
Rothschild and Spencer. In a seminal paper written in 1935, Erdés and his 
co-author, George Szekeres, tackled the following beautiful problem of Esther 
Klein: can we find, for a given n, a number N(n) such that from any set of N 
points in the plane it is possible to select n points forming a convex polygon? 
Erdés and Szekeres showed that the existence of N(n) is an easy consequence 
of Ramsey’s theorem for finite sets. In fact, they discovered Ramsey’s theorem 
for themselves, and were told only later that they had been beaten to it by 
Ramsey. It is remarkable that Ramsey, working in Cambridge, and Erdés and 
Szekeres, working in Budapest, arrived at the same result independently and 
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in totally different ways, but within a few years of each other. As it happened, 
the proof given by Erd6és and Szekeres is much simpler than the original, and 
it also gives much better upper bounds for the various Ramsey numbers. In 
particular, they proved that if k,! > 2 then 


k+l—2 
< 
rats (*T'7?), 


where the Ramsey number R(k,1) is the smallest value of n for which every 
graph of order n contains either a complete graph of order & or | independent 
vertices. 

In view of the simplicity of the proof of the Erdds-Szekeres bound, it is 
amazing that over 50 years had to pass before the bound above was improved 
appreciably. In 1986 R6odl showed that there is a positive constant c > 0 such 
that 


R(k, 1) < (* oe *) poe iy, 


and, simultaneously and independently, Thomason replaced the power of the 
logarithm by a power of k + 1. To be precise, Thomason proved that 


R(k,) < ee ake ’) 
’ S a 


for some absolute constant A > 0 and all k,! with k > 1 > 2. 

Concerning the lower bounds for R(k,1/), especially R(k,k), the situation 
seems to be even more peculiar. It is not even obvious that R(k,k) is not 
bounded from above by a polynomial of k. Indeed, it was again Erdés, 
who gave, in 1947, the following lower bound: if @ieG < 1 then 
R(k,k) > n. Erdés’s proof is remarkable for its simplicity and its influence 
on combinatorics. Although there are very few mathematicians who do not 
know this proof, we present it here, since it is delightful and brief. Consider 
the set of all 2(2) graphs on {1,2,...,n}. What is the average number of 


complete subgraphs of order k? Since each of the (j) possible complete 


subgraphs of order k is contained in 2(3)-(2) of our graphs, the average is 
k 
(n)2-() < 1/2. Similarly, the average number of complete subgraphs of order 


k in the complements of our graphs is also (a5) < 1/2. Consequently, 
there is some graph G on {1,2,...,n} such that neither G nor its complement 
G contains a complete graph of order k. Hence R(k,k) > n, as claimed. 

It took over three decades to improve this wonderfully simple lower bound: 
in 1977 Spencer showed that an immediate consequence of the Erdés-Lovasz 
Local Lemma is that 


R(k, k) > k2*/? (2 + x) ; 
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which is only about a factor 2 improvement. Needless to say, the combina- 
torialists are eagerly awaiting a breakthrough that more or less eliminates 
the gap between the upper and lower bounds for R(k,k), but judging by the 
speed of improvements on the original bounds of Erdos, we are in for a long 
wait. 

Erdés did not fail to notice that the other theorem of Ramsey from 
1930, concerning infinite sets, also had a tremendous potential. In the 1950s 
and 1960s, mostly with his two great collaborators, Rado and Hajnal, Erdés 
revolutionized combinatorial set theory. 

Ramsey’s theorem concerning infinite sets, in its simplest form, states 
that if G is an infinite graph then either G or its complement G contains an 
infinite complete graph. While this is very elegant, in order to express more 
complicated results succinctly, it is convenient to rely on the Erdds-Rado 
arrow notation. Given cardinals r, a and b,, y € I, where I’ is an indexing 
set, the partition relation 


a+ (by) yer 
is said to hold if, given any partition User I,, of the set A(r) of all subsets of 
cardinality r of a set A with |A| =a, there isa y € I and a subset B, of A 
with |B,| = 6, such that BY C I,. The same notation is used to express the 
analogous assertion when some or all the symbols r, a and 6, denote order 
types rather than cardinalities. If [ is a small set then one tends to write out 
all the bys. 
Thus, in this notation, the infinite Ramsey theorem is that 


No — (No, No)” 


for every integer r, with r = 2 being the case of graphs. 

In 1933, Sierpinski proved that there is a graph of cardinality 28° which 
has neither an uncountable complete graph nor an uncountable independent 
set: 


2% (N1,N1)?, 


so the “natural” extension of Ramsey’s theorem is false. Sierpinski’s result 
says that one can partition the pairs of real numbers in such a way that every 
uncountable subset of R contains a pair from both classes. This partition 
somewhat resembles a Bernstein subset of R. 

If we are happy with one of the classes being merely countably infinite 
then Ramsey’s theorem extends to all cardinals. This was proved in 1941 by 
Dushnik and Miller for regular cardinals, and extended by Erdos to singular 
cardinals. Thus, 


K > (K, No)”. 


In the language of graphs, this means that if a graph on & vertices does 
not contain a complete subgraph on « vertices then it contains an infinite 
independent set. 
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The Erdés-Rado collaboration on partition problems started in 1949. 
One of their first results is an attractive assertion concerning Q, the set of 
rationals. If G is a graph on Q then either G or its complement G contains a 
complete graph whose vertex set is dense in an interval. Years later this was 
considerably extended by Galvin and Laver. 

After a good many somewhat ad hoc results, in 1956 Erdés and Rado 
gave the first systematic treatment of “arrow relations” for cardinals; in 
their fundamental paper, “A partition calculus in set theory”, they set out to 
establish a ‘calculus’ of partitions. Among many other results, they proved 
that if A > 2 and p> No are cardinals then 


AN SOR sn) 
but. 
Mw ((A+p)t, pt)’. 


In the special case \ = 2 and p = No, the last relation is precisely Sierpinski’s 
theorem. 

In proving their positive results, Erdés and Rado used so called “tree 
arguments”, arguments resembling the usual proof of Ramsey’s infinite 
theorem, but relying on sequences of transfinite length. Another important 
ingredient is a stepping-up lemma, enabling one to deduce arrow relations 
about larger cardinals from similar relations about smaller ones. Thus the 
trivial relation Xi + (Ni)x, implies that 


(28°) * — (i)Ro- 


In 1965, in a monumental paper “Partition relations for cardinal numbers” , 
running to over 100 pages, Erdés, Hajnal and Rado presented an almost 
complete theory of the partition relation above for cardinals, assuming the 
generalized continuum hypothesis. For years after its publication, its authors 
lovingly referred to their paper as GTP, the Giant Triple Paper. 

In fact, Erdés had taken an interest in extensions of Ramsey’s theorem 
for infinite sets well before the Dushnik-Miller result appeared. In 1934, in 
a letter to Rado, he asked whether if we split the countable subsets of a set 
A of cardinality a into two classes then there is an infinite subset B of A, 
all of whose countable subsets are in the same class. In the arrow notation, 
does a — (No, Xo)*° hold for some cardinal a? Almost by return mail, Rado 
sent Erdés his counterexample (which is, by now, well known), constructed 
with the aid of the axiom of choice. Later this question led to the study of 
partitions restricted in some way, including the study of Borel and analytic 
partitions, and to many beautiful results of Galvin, Mathias, Prikry, Silver, 
and others. 

The first results concerning partition relations for ordinals were also 
obtained in 1954. In November 1954, on his way to Israel, Erdés passed 
through Ziirich. He told his good friend Specker that he was offering $20 for 
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a proof or disproof of the conjecture of his with Rado that w? > (w?,n)?. 
Within a few days, Specker sent Erdés a proof which is, by now, well 
known. Erdos had high hopes of building on Specker’s proof to deduce that 
w” — (w”, 3)? for every integer n > 3, but could prove only w2” > (w"t1, 4)?, 
A little later Specker produced an example showing that 


wo” fy (w", 8)? 


for every integer n > 3. 
Neither Specker’s proof, nor his (counter)example worked for 


ti Ss (es, 8): 
Erdos rated this problem so highly that eventually, in the late 1960s, he 
offered $250 for a proof or counterexample. The prize was won by Chang in 
1969 with a very complicated proof, which was later simplified by Milner and 
Jean Larson. 


The remaining problems are far from being easy, and Erdés is now offering 
$1,000 for a complete characterization of the values of a and n for which 


i Sela? ny? 


holds. 

The theory of partition relations for ordinals took off after Cohen 
introduced forcing methods and Jensen created his theory of the constructible 
universe. Not surprisingly, in many questions “independence reared its ugly 
head”, as Erdos likes to say. In addition to Erdds, Hajnal and Rado, a 
host of excellent people working on combinatorial set theory contributed 
to the growth of the field, including Baumgartner, Galvin, Larson, Laver, 
Maté, Milner, Prikry and Shelah. An account of most results up to the 
early 1980s can be found in the excellent monograph “Combinatorial Set 
Theory: Partition Relations for Cardinals” by Erdos, Hajnal, Maté and Rado, 
published in 1984. 

In 1940 Turan proved a beautiful result concerning graphs, vaguely related 
to Ramsey’s theorem. For 3 <r <n, every graph of order n that has more 
edges than an (r — 1)-partite graph of order n contains a complete graph 
of order r. It was once again Erdés who, with Turan, Gallai and others, 
showed that Turan’s theorem is just the starting point of a large and lively 
branch of combinatorics, extremal graph theory. In order to formulate the 
quintessential problem of extremal graph theory, let us recall some notation. 
As usual, we write |G| for the order (i.e. number of vertices) and e(G) for 
the size (i.e. number of edges) of a graph G. Given graphs G and H, the 
expression H C G means that H is a subgraph of G. Let F be a fixed graph, 
usually called the forbidden graph. Set 


ex(n; F) = max{e(G) : |G] =n and F ¢ G}. 
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and 
EX(n; F) ={G: |G| =n,e(G) = ex(n; F), and F ¢ G}. 


We call ex(n; F’) the extremal function, and EX(n;F) the set of extremal 
graphs for the forbidden graph F’. Then the basic problem of extremal graph 
theory is to determine, or at least estimate, ex(n;F') for a given graph 
F and, at best, to determine EX(n;F). From here it is but a short step 
to the problem of excluding several forbidden graphs, i.e. to the functions 
ex(n; Fi,...,F,) and EX(n,Fi,...,F,) for a finite family Fi,...,F, of 
forbidden graphs. 

Writing K,. for the complete graph of order r, and T;(n) for the unique 
k-partite graph of order n and maximal size (so that T;,(n) is the k-partite 
Turdn graph of order n), Turan proved, in fact, that EX(n; K,) = {T--1(n)}, 
ie. T,—1(n) is the unique extremal graph, and so ex(n; K") = t,_1(n), where 
tr_1(n) = e(T;,,(n)) is the size of T,_1(n). 

As it happens, Erdés came very close to founding extremal graph theory 
before Turan proved his theorem: in 1938, in connection with sequences of 
integers no one of which divided the product of two others, proved that for 
a quadrilateral C4 we have ex(n;C*) = O(n3/?). However, at the time Erdés 
failed to see the significance of problems of this type: one of the very few 
occasions when Erdoés was “blind”. 

Before we mention some of the important results of Erdés in extremal 
graph theory, let us remark that in 1970 (!) Erdés proved the following 
beautiful extension of Turdn’s theorem (so the rest of the world had been 
blind). Let G be a graph without a K;,, with degree sequence (d;){. Then 
there is an (r — 1)-partite graph G* (which, a fortiori, contains no K, either) 
with degree sequence (d*)', such that d; < d* for every 7. In this theorem, 
the achievement is in the audacity of stating the result: once it is stated, the 
proof follows easily. 

Erd6és conjectured another extension of Turan’s theorem which was 
proved in 1981 by Bollobés and Thomason. The conjecture was sharpened 
by Bondy. Let |G| =n and e(G) > t,_1(n). Then every vertex x of maximal 
degree d in G is such that the neighbours span a subgraph with more than 
ty—2(d) edges. In this instance it is also true that once the full assertion has 
been made, the proof is just about trivial; in fact, it is simply a minor variant 
of the proof of the previous theorem of Erdés. 

It is fitting that the fundamental theorem of extremal graph theory is a 
result of Erdés, and his collaborator, Stone. Note that, by Turan’s theorem, 
the maximal size of a K,-free graph of order n is about r=? (5), in fact, 


2 
trivially, 
r—-2/(n r—2n? 
<t,(n) < = 
r—1\2 r—12 


Writing, as usual, K,(t) for the complete r-partite graph with t vertices in 
each class, Erdés and Stone proved in 1946 that if r > 2,t > 1 ande> 0 are 
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fixed and n is sufficiently large then every graph of order n and size at least 


2 more edges than 


a 3) (5) contains a K,(t). In other words, even en 


can be found in a Turan graph guarantee not only a K, but a “thick” K,, 
one in which every vertex has been replaced by a group of t vertices. 
Prophetically, Erdés and Stone entitled their paper “On the structure 
of linear graphs”; this is indeed the significance of the paper: it not only 
gives us much information about the size of extremal graphs, but it is also 
the starting point for the study of the structure of extremal graphs. If F’ 
is a non-empty r-chromatic graph, ie. y(F) = r > 2, then, precisely by 
the definition of the chromatic number, F is not a subgraph of T;._;(n), so 
ex(n;F) > tr-i(n) > %=2(3). On the other hand, F Cc K,(t) if t is large 


enough (say, t > |F'|), so if « > 0 and n is large enough then 


ex(n, F) < — +e) Gh: 


In particular, if y(F') =r > 2 then 


lim ev(n.F)I (3) ae 


n—- oo 2 r—1’ 


that is the asymptotic density of the extremal graphs with forbidden subgraph 
F'(r — 2)/(r — 1). Needless to say, the same argument can be applied to the 
problem of forbidding any finite family of graphs: given graphs F), Fo,..., Fx, 
with min y(F;) =r > 2, we have 


lim ea(n, Fi,... ADI(3) a = 
noo 2 r—1 

Starting in 1966, in a series of important papers Erdés and Simonovits went 
considerably further than noticing this instant consequence of the Erdés- 
Stone theorem. Among other results, Erdés and Simonovits proved that if 
G € EX(n;F), with y(F) =r > 2, then G can be obtained from T,_\(n) by 
the addition and deletion of o(n?) edges. Later this was refined to several 
results concerning the structure of extremal graphs. Here is an example, 
showing how very close to a Turan graph an extremal graph has to be. 
Let F,,...,F, be fixed graphs, with r = min y(F;), and suppose that F, has 
an r-colouring in which one of the colour classes contains t vertices. Let 
G, € EX(n;F,...,F%). Then, asn > ov, 

(i) The minimal degree of Gy, is ((r — 2)/(r — 1) + o(1))n, 

(ii) The vertices of G,, can be partitioned into r —1 classes such that each 
vertex is joined to at most as many vertices in its own class as in any 
other class, 

(iti) For every « > 0 there are at most c. = c(e; F\,..., Fy) vertices joined to 
at least en vertices in their own class, 

(iv) There are 0(n?—1/*) edges joining vertices in the same class, 

(v) Each class has n/(r — 1) + O(n'—1/*) vertices. 
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Returning to the Erdés-Stone theorem itself, let us remark that Erdés 
and Stone also gave a bound for the speed of growth of ¢ for which K,(t) 
is guaranteed to be a subgraph of every graph with n vertices and at least 
((r — 2)/(r — 1) + €)($) edges. 

Let us write t(n,r,€) for the maximal value of t that will do. Erdés and 
Stone proved that t(n,r,¢) > (log,—,(n))!~® for fixed r > 2, « > 0 and 
5 > 0, and large enough n, where log;,(n) is the k times iterated logarithm 
of n. They also thought it plausible though unproved that log,_j(n) would 
be about the “best” value. 

This assertion was conjectured in several subsequent papers by Erdés, so 
it was rather surprising when, in 1973, Erdés and Bollobas proved that, for 
fixed r > 2 and 0 < « < I/(r —1) the correct order of t(n,r,€) is, in fact, 
logn. 

A little later, in 1976, Erdés, Bollobés and Simonovits sharpened this 
result, and the dependence of the implicit constant on r and e€ was finally 
settled by Chvatal and Szemerédi, who proved that there are positive absolute 
constants cy and cz such that 

logn logn 
Vog(tfe) = M9 S ogo) 
whenever r > 2 and0<e<1/(r—-1). 

For a bipartite graph F’, the general Erddés-Stone theorem is not sensitive 
enough to provide non-trivial information about ex(n; F), since all it tells us 
is that ex(n;F) = o(n?). It was, once again, Erdés, who proved several of 
the fundamental results about ex(n; F’) when F is bipartite. In particular, he 
proved with Gallai in 1959 that for a path P; of length / we have 


-1 
ex(n; pi) < =a i 


By taking vertex-disjoint unions of complete graphs of order 1, we see that 
this inequality is, in fact, an equality whenever I|n. The determination of 
ex(n; P,) was completed by Faudree and Schelp in 1973. 

Another ground-breaking result of Erdés concerns supersaturated graphs, 
i.e. graphs with slightly more edges than the extremal graph. An unpublished 
result of Rademacher from 1941 claims that a graph of order n with more 
than |n?/4| = t2(n) edges contains not only one triangle but at least |n/2| 
triangles. In 1962 Erdés extended this result considerably; he showed that for 
some constant c > 0 every graph with n vertices and |n?/4| + k edges has 
at least k|n/2| triangles, provided 0 < k < cn. Later, this led to a spate of 
related results by Erdés himself, Moon and Moser, Lovasz and Simonovits, 
Bollobas and others. 

Erdés, the problem-poser par excellence, could not fail to notice how 
much potential there is in combining Ramsey-type problems with Turan-type 
problems. 
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The extremal graph for K,, namely the Turan graph T;._1(n), is stable in 
the sense that if a K,-free graph G on n vertices has almost as many edges 
as T,-1(n), then G is rather similar to T,—1(n); in particular, it has a large 
independent set. Putting it another way, if G is K,-free and does not have a 
large independent set then e(G) is much smaller than t,_;(n). 

This observation led Erdés and Sds to the prototype of Ramsey-Turdn 
problems. Given a graph H and a natural number J, let f(n;H,1) be the 
smallest integer m for which every graph of order n and size more than m 
either contains H as a subgraph, or has at least | independent vertices. 

Erdés and Sés were especially interested in the case H = K, and 1 = o(n), 
and so in the function 

ery n 
Hr) = him tim f(s By Len))/($). 
It is easily seen that /(3) = 0, and in 1969 Erdés and Sds proved that I(r) = 
(r — 3)/(r — 1) whenever r > 3 is odd. 

The stumbling block in determining I(r) for even values of r was the 
case r = 4. Szemerédi proved in 1972 that f(4) < 1/4, but it seemed likely 
that f(4) is, in fact, 0. Thus it was somewhat of a surprise when in 1976 
Erdos and Bollobas constructed a graph on a k-dimensional sphere that shows 
f(4) = 1/4. In fact, this graph is rather useful in a number of other questions 
as well; it would be desirable to construct an infinite family of graphs in this 
vein. 

Erdés, Hajnal, Sés and Szemerédi completed the determination of I(r) in 
1983 when they showed that I(r) = (8r — 10)/(3r — 4) whenever r > 4 is 
even. Note that the condition that our graph does not have more than o(n) 
independent vertices, does force the graph to have considerably fewer edges: 
Turan’s theorem tells us that without the condition on the independence 
number the limit would be (r — 2)/(r — 1). 

Erdos was still a young undergraduate, when he became interested in 
extremal problems concerning set systems. It all started with his fascination 
with Sperner’s theorem on the maximal number of subsets of a finite set with 
no subset contained in another. Sperner proved in 1928 that if the ground 
set has n elements then the maximum is attained by the system of all |n/2|- 
subsets. Erdés was quick to appreciate the beauty and importance of this 
result, and throughout his career frequently returned to problems in this 
vein. 

In 1939, Littlewood and Offord gave estimates of the number of real 
roots of a random polynomial of degree n for various probability spaces of 
polynomials. In the course of their work, they proved that for some constant 


c> 0, if 2, 22,..., 2, are complex numbers with |z;| > 1 for each 7, then of 
the 2” sums of the form +2] + z2+...+ 2, no more than 
cr2" (log n)n~1/? (7) 


fall into a circle of radius r. 
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On seeing the result, Erdés noticed immediately the connection with 
Sperner’s theorem, especially in the real case. In fact, Sperner’s theorem 
implies the following best possible assertion. If x,,...,2%,, are real numbers of 
modulus at least 1, then no more than le 70) of the sums +2, +X ae Ly 


fall in an open interval of length 2. From here it was but a short ip to aghag 
that the maximal number of sums that can fall in an open interval of length 
2r is precisely the sum of the r largest binomial coefficients Gap 

Concerning the complex case, Erdés improved the Littlewood-Offord 
bound (7), to an essentially best possible bound, by removing the factor 
logn. More importantly, Erdés conjectured that the Sperner-type bound 
holds not only for real numbers, as he noticed, by for vectors of norm at 
least 1 in a Hilbert space. This beautiful conjecture was proved 20 years 
later by Kleitman and, independently, by Katona. In 1970, Kleitman gave a 


strikingly elegant proof of the even stronger assertion that if 71, 272,...,@n 
are vectors of norm at least 1 in a normal space, then there are at most C >) 
sums of the form +7; + vg +...+ 2, such that any two of them are at a 


distance less than 2. 

With Offord, in 1956 Erddés tackled the original Littlewood-Offord 
problem concerning random polynomials. They concentrated on the class 
of 2” polynomials of the form f,(x) = +a” + 2" 1. Refining the 
result of Littlewood and Offord, they proved that, with the exception of 

o((log log n)~!/3)2" polynomials, the equations f,,(« Ve = 0 have 


2 
= log n + o((log n)?/ log log n) 


real roots. 

Let us turn to some results concerning hypergraphs, the objects most 
frequently studied in the extremal theory of set systems. For a positive integer 
r, an r-uniform hypergraph, also called an r-graph or r-uniform set system, 
is a pair (X,A), where X is a set and J is a subset of X‘), the set of all 
r-subsets of X. The vertex set of this hypergraph is X, and A is the set of 
(hyper) edges. The vertex set is frequently taken to be [n] = {1,...,n}, and 
our hypergraph is often referred to as a “collection of r-subsets of [n]”. For 
r = 2 an r-graph is just a graph. Although r-graphs seem to be innocuous 
generalizations of graphs, they are much more mysterious than graphs. 

The most influential paper of Erdd6s on hypergraphs, “Intersection 
theorems for systems of finite sets” , written jointly with Chao Ko and Richard 
Rado, has a rather curious history. The research that the paper reports on 
was done in 1938 in England. However, at the time there was rather little 
interest in pure combinatorics, and the authors went their different ways: 
Erdés went to Princeton, Chao Ko returned to China, and Rado stayed in 
England. As a result of this, the paper was published only in 1961. 

In its simplest form, the celebrated Erdés-Ko-Rado theorem states the 
following. Let A C [n]”, that is let A be a collection of r-subsets of the set 
[In] = {1,2,...,n}. Ifn > 2r and A is intersecting, that is if ANB # 


32 Béla Bollobas 


whenever A,B € A, then |A| < (7}). Taking A = {A € [n] : 1 € A}, we 
see that the bound is best possible. This result has been the starting point 
of much research in combinatorics. By now there are a good many proofs of 
it, including a particularly ingenious and elegant proof found by Katona in 
1972. 

The more general Erdés-Ko-Rado theorem states that if1<t<r, AC 
[n]) and A is t-intersecting, that is if |AN B| >t whenever A,B € A, then 


|A| < coma provided n is large enough, depending on r and t. 


In the original paper it was proved that n >t+(r-—t) ae will do. Once 
again, the bound on |.A| is best possible, as shown by a collection of r-subsets 
containing a fixed t-set. But the bound on n given by Erdés, Ko and Rado 
is far from being best possible. For 7 = 0,1,...,r —t, let 


A; = {Ae [n]% : |An [e+ 2i]| > t+ 4}. 


It is clear that each A; is a ¢-intersecting system, and it so happens that 
|Ai| > |Ao| if n < (£+1)(r —t +1). Thus the best we can hope for is that 
the Erdés-Ko-Rado bound ("7/) holds whenever n > (t + 1)(r —t +1). 

It took many years to prove that this is indeed the case. In 1976 Frankl 
came very close to proving it: he showed it for all t except the first few values, 
namely for allt > 15. Finally, by ingenious arguments involving vector spaces, 
Richard Wilson gave a complete (and self-contained) proof of it in 1984. 

The Erd6és-Ko-Rado theorem inspired so much research that in 1983 Deza 
and Frankl considered it appropriate to write a paper entitled “The Erdés- 
Ko-Rado theorem — 22 years later”. 

The first Erdos-Rado paper that appeared in print, in 1950, contained 
their canonical Ramsey theorem for r-graphs, to be precise, for N‘"), the 
collection of r-subsets of N. This is yet another Erd6és paper which had much 
influence on the development of Ramsey theory, especially through the work 
of Graham, Leeb, Rothschild, Spencer, Nesetril, Rodl, Deuber, Voigt and 
Promel. To formulate this result, let X C N or, for that matter, let X be any 
ordered set, and let r be an integer. A partition of X‘” into some classes 
(finitely or infinitely many) is said to be canonical if there is a set I C [r] 
such that two r-sets A = (a1,...,ar),B = (b1,...,b») € X) belong to the 
same class if, and only if, a; = b; for every 7 € I. Here we assumed that 
ay <... <a, and 6; <... < b,. Thus in a canonical partition, A and B 
belong to the same class if, and only, for each i € I, the ith element of A is 
identical with the ith element of B. 

Note that if all we care about is whether two r-sets belong to the same 
class or not, then for every ordered set X with more than r elements, X ”) 
has precisely 2” distinct canonical partitions, one for each subset I of [r]. If 
X is infinite then there is only one canonical partition with finitely many 
classes: this is the canonical partition belonging to I = 9, in which all r-sets 
belong to the same class. 
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The Erdés-Rado canonical Ramsey theorem claims that if we partition 
N®) into any number of classes then there is always an infinite sequence 
of integers x1 < £2 < ... on which the partition is canonical. If N is 
partitioned into only finitely many classes then, as it was just remarked, on 
X = {x1,2%2,...} the canonical distribution belongs to J = Q, that is all r-sets 
of X belong to the same class. Thus Ramsey’s theorem for infinite sets is an 
instant consequence of the Erdés-Rado result. 

The canonical Ramsey theorem has attracted much attention: it has been 
extended to other settings many times over, notably by Erdés, Rado, Galvin, 
Taylor, Deuber, Graham, Voigt, Nesetril and Rédl. 

In 1952 Erdés and Rado gave a rather good upper bound for the Ramsey 
number R()(n,...,n)x = R“(n;k) concerning (r)-graphs: R\)(n;k) is the 
minimal integer m such that if [m]‘”) is partitioned into k classes then there 
is always a subset N € [m]™ all of whose r-sets are in the same class. 
Putting it slightly differently, R(n;k) is the minimal integer m such that 
every k-colouring of the edges of a complete r-graph of order m contains a 
monochromatic complete r-graph of order n. Writing exp, & for the s times 
iterated exponential so that exp, k = k,expy k = k*, and exp; k = Kee Erdés 
and Rado proved that 


R™ (n; k)/" < exp,_, k. 


Although this seems a rather generous bound, in their GTP, Erdés, Hajnal 
and Rado proved that for r > 3 we have 


RO (nk)! > exp,_g k. 


Erdos believes that the right order is given by the upper bound. 

An important question concerning hypergraphs is to what extent the 
Erdés-Stone theorem can be carried over to them. The density d(G) of an 
r-graph G = (X, A) of order n is 


a(@) = la/(”), 


so that 0 < d(G) < 1 for every hypergraph. Call 0 < a < 1 a jump-value for 
r-graphs if there is a 6 = 8,(a@) > a such that for every a’ > a and positive 
integer m there is an integer n such that every r-graph of order at least n 
and density at least a’ contains a subgraph of order at least m and density at 
least 6. An immediate consequence of the Erdés-Stone theorem is that every 
qa in the range 0 < a < 1 is a jump-value. 

In 1965 Erdés proved that, for every r > 1, 0 is a jump-value for r-graphs 
and, in fact, 8,(0) = r!/r” will do. This is because if a’ > 0, m > 1 and n is 
sufficiently large then every r-graph of order at least n and density at least 
a’ contains a Ki” (m), a complete r-partite r-graph with m vertices in each 
class. Clearly, 
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atc (ony) =e /(™) ~ F 
This seems to indicate that every a, 0 < a < 1, is a jump-value for r-graphs 
for every r > 3 as well. Nevertheless, for years no progress was made with the 
problem so that, eventually, Erdés was tempted to offer $1,000 for a proof 
or disproof of this assertion. In 1984, Frankl and Rédl won the coveted prize 
when they showed that 1 —1~‘"-)) is not a jump-value for r-graphs if r > 3 
and | > 2r. In spite of this beautiful result, we are very far from a complete 
characterization of jump-values. 

The important topic of A-systems was also initiated by Erdés. A family 
of sets {A,},er is called a A-system if any two sets have precisely the same 
intersection, that is if the intersection of any two of them is (] yer Ay: Given 
cardinals n and p, let f(n;p) be the maximal cardinal m for which every 
collection of m sets, each of size (at most) n, contains a A-system of size p. 
In 1960, Erdés and Rado determined f(n;p) for infinite cardinals but found 
that surprising difficulties arise when n and p are finite. Even the case p = 3 
seems very difficult, so that they could not resolve their conjecture that 


f(n;3) < e" (8) 


for some constant c. 

As Erdés and Rado pointed out, it is rather trivial that f(n;3) > 2”. 
Indeed, let A be the collection of n-subsets of a 2n-set {21,...,%n,Y1,---;Yn} 
containing precisely one of x; and y; for each i. Then |.A| = 2” and A does 
not contain a A-system on three sets. 

Abbott and Hanson have improved this bound to f(n;3) > 10"/?, but 
due to the very slow progress with the upper bound, for years now Erddés 
has offered $1,000 for a proof or disproof of (8). Recently, Kostochka has 
made some progress with the problem when he proved that f(n;3) < 

—=Tb 

Let us turn to a flourishing area of mathematics that was practically 
created by Erdos. This is the theory of random graphs, started by Erdés and 
then, a little later, founded by Erdés and Rényi, 

Throughout his career, Erdés had a keen eye for problems likely to yield 
to either combinatorial or probabilistic attacks. Thus it is not surprising that 
he had such a tremendous success in combining combinatorics and probability. 

At first, Erdés used random methods to tackle problems in main- 
stream graph theory. We have already mentioned the delightful probabilistic 
argument Erdos used in 1947 to give a lower bound for the Ramsey number 
R(k,k). A little later, in 1959, a more difficult result was proved by Erdés 
by random methods: for every k > 3 and g > 3 there is a graph of 
chromatic number k and girth g. Earlier results in this vein had been proved 
by Tutte, Zykov, Kelly and Mycielski, but before this beautiful result of 
Erdés, it had not even been known that such graphs exist for any k > 6. 
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Later ingenious constructions were given by Lovasz, NeSetiil and Rodl, but 
these constructions lead to considerably larger graphs than obtained by 
Erdés. 

In a companion paper, published in 1961, Erdés turned to lower bounds 
for the Ramsey numbers R(3,/), and proved by similar probabilistic argu- 
ments that R(3,1) > col?/(logl)? for some positive constant co. In 1968, 
Graver and Yackel gave a good upper bound for R(3,1), which was improved, 
in 1972, by Yackel. As expected, further improvements were harder to come 
by. In 1980, Ajtai, Komldés and Szemerédi proved that R(3,1) < c,l?/log 1; the 
difficult proof was simplified a little later by Shearer. Very recently, J.B. Kim 
improved greatly the lower bound due to Erdés, and so now we know that 
the order of R(3,1) is 1?/logl. 

Almost simultaneously with his beautiful applications of random graphs 
to extremal problems, Erdés, with Rényi, embarked on a systematic study of 
random graphs. The first Erddés-Rényi paper on random graphs, in 1959, 
is about the connectedness of Gp, the random graph with vertex set 
[n] = {1,2,...,n}, with M randomly chosen edges. Extending an unpublished 
result of Erdés and Whitney, they proved, among others, that if c € R and 
M = M(n) = [$n(logn +c)| then 


lim P(G,,m is connected) = ee 
noo 


This implies, in particular, that if M = M(n) = |$n(logn + w(n))| then 


0 if w(n) > -w, 
lim (Gna is connected) = e° if w(n)>ceER, 
1 if w(n) > co. 


The result is easy to remember if one notes that the “obstruction” to 
the connectedness of a random graph is the existence of isolated vertices: if 
|w(n)| is not too large, say at most loglogn, then Gp, is very likely to be 
connected if it has no isolated vertices (and if it does have isolated vertices 
then, a fortiori, it is disconnected). 

By now, quite rightly, this is viewed as a rather simple result, but when 
it was proved, it was very surprising. To appreciate it, note that a graph of 
order n with as few as n — 1 edges need not be disconnected, and a graph of 
order n with as many as (n — 1)(n — 2)/2 edges need not be connected. 

A little later, in 1960, in a monumental paper, entitled “On the evolution 
of random graphs”, Erdés and Rényi laid the foundation of the theory of 
random graphs. As earlier, they studied the random graphs Gp,ay with n 
labelled vertices and M random edges for large values of n, as M increased 
from 0 to Hil) They introduced basic concepts like “threshold function”, 
“sharp threshold function”, “typical graph”, “almost every graph”, and so 
on. An important message of the paper was that most monotone properties 
of graphs appear rather suddenly. A property Q,, of graphs of order n is said 
to be monotone increasing if Qn is closed under the addition of edges. Thus 
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being connected, containing a triangle or having diameter at most five are 
all monotone increasing properties. Erdés and Rényi showed that for many a 
fundamental structural monotone increasing property Q,, there is a threshold 
function, that is a function M*(n) such that 


0 if M(n)/M*(n) > 0, 
1 if M(n)/M*(n) > oo. 


n—->co 


lim P(Gr,w has Qn) = 


Later it was noticed by Bollobés and Thomason that in this weak sense 
every monotone increasing property of set systems has a threshold function; 
recently a considerably deeper result has been proved by Friedgut and Kalai, 
which takes into account the automorphism group of the property, and so is 
much more relevant to properties of graphs. 

The more technical part of the “Evolution” paper concerns cycles, trees, 
the number of components and, most importantly, the emergence of the giant 
component. Erdés and Rényi showed that if M(n) = |cn| for some constant 
c > 0 then, with probability tending to 1, the largest component of G’,,y¢ is 
of order logn if c < s, it jumps to order W/ ife= 4 and it jumps again, 
this time right up to order n if c > 5. Quite understandably, Erdds and 
Rényi considered this “double jump” to be one of the most striking features 
of random graphs. 

By now, all this is well known, but in 1960 this was a striking discovery 
indeed. In fact, for over two decades not much was added to our knowledge of 
this phase transition or, as called by many a combinatorialist, the emergence 
of the giant component. The investigations were reopened in 1984 by the 
author of these lines with the main aim of deciding what happens around 
M = |n/2]; in particular, what scaling, what magnification we should use to 
see the giant component growing continuously. It was shown, among others, 
that if M = n/2+s and s = o(n) but slightly larger than n?/* then, with 
probability tending to 1, there is a unique largest component, with about 4s 
vertices, and the second largest component has no more than (log n)n?/s? 
vertices. Thus, in a rather large range, on average every new edge adds four 
new vertices to the giant component! 

With this renewed attack on the phase transition the floodgates opened, 
and quite a few more precise studies of the behaviour of the components near 
the point of phase transition were published, notably by Stepanov (1988), 
Flajolet, Knuth and Pittel (1989), Luczak (1990, 1991) and others. To cap it 
all, in 1993 Knuth, Pittel, Janson and Luczak published a truly prodigious 
(over 120 pages) study, “The birth of the giant component”, giving very de- 
tailed information about the random graph G’, yy near to its phase transition. 

Erdés and Rényi also stated several problems concerning random graphs, 
thereby influencing the development of the subject. In 1966, they themselves 
solved the problem of 1-factors: if n is even and M = M(n) = | $(logn+c)| 


then the probability that G,,,7 has a 1-factor tends to e-©  asn— oo; the 
“obstruction” is, once again, the existence of isolated vertices. 
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The Hamilton cycle problem was a much harder nut to crack. As a 
Hamiltonian graph is connected (and has minimal degree 2), it is rather 
trivial that if, with probability tending to 1, G,,,.17 has a Hamilton cycle and 
if M = M(n) is written as 


M =M(n)= 5 (logn + log logn + w(n)), 


then we must have w(n) + oo. On the other hand, it is far from obvious that a 
“typical” Gn, is Hamiltonian, even if M = |cnlogn| for some large constant 
c. This beautiful assertion was proved in 1976 by Pdésa, making use of his 
celebrated lemma. Several more years passed, before Komlés and Szemerédi 
proved in 1983 that w(n) — oo also suffices to ensure that a “typical” Gaz is 
Hamiltonian. A little later Bollobads proved a sharper, hitting time type result 
that had been conjectured by Erd6s and Spencer, connecting Hamiltonicity 
with having minimal degree at least 2. 

The chromatic number problem from the 1960 “Evolution” paper of Erdés 
and Rényi was the last to fall. In 1988 the author of this note proved that 
picking one of the 2(2) graphs on [n] at random, with probability tending to 
1, the chromatic number of the random graph is asymptotic to 1082» / log n. 
Earlier results had been obtained by Grimmett and McDiarmid, Bollobas and 
Erdés, Matula, Shamir and Spencer, and others, and subsequent refinements 
were proved by Frieze, Luczak, McDiarmid and others. 

The tremendous success of the theory of random graphs in shedding 
light on a variety of combinatorial, structural problems concerning graphs 
foreshadows the use of random methods in other branches of mathematics. 
Graphs carry only a minimal structure so they are bound to yield to detailed 
statistical analysis. However, as we acquire more expertise in applying 
results of probability theory, we should be able to subject more complicated 
structures to statistical analysis. In keeping with this philosophy, having 
founded, with Rényi, the theory of random graphs, Erdds turned to “the 
theory of random groups” with another great collaborator, Paul Turan. In a 
series of seven substantial papers, published between 1965 and 1972, Erdés 
and Turan laid the foundations of statistical group theory. 

For simplicity, let us consider the symmetric group S,,, and let 7, be a 
random element of S,,, with each of the n! possibilities equally likely. Thus 
Tn ts a random permutation of [n| = {1,2,...,n}, and every function of 7, 
is a random variable. One of the simplest of these random variables is the 
order O(m,) of a permutation 7p. 

Concerning g(n) = max,zes, O(7), the maximal order of a permutation, 
it was already shown by Landau in 1909 that 


Thus O(7,,) is always small compared to the order of the group S,,, although 
it can be rather large! 
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In contrast, for a single cycle of length n has order n, although such cycles 
constitute a non-negligible fraction, namely a fraction 1/n, of all possible 
permutations. What is then the order of most elements of S,,? 

As the starting point of their investigations, Erdés and Turan proved 
that for a “typical” permutation 7,, the order O(7) is much smaller than 
the maximum g(n) = exp{(nlogn)!/?(1 + o(1))}, and much larger than n. In 
fact, if w(n) > oo (arbitrarily slowly, as always) then 


1 
lim P(|log O(n) — 5 log? n| > w(n) log?/? n) = 0. 
noo 


Thus the “typical” order is about 5 log? n. 
Erdés and Turan went on to prove that, asymptotically, O(7,) has a 
log-normal distribution: as n — 00, 


= 1 
V3 (log O(a) — > log? n) / log?/? n 
tends, in distribution, to the standard normal distribution, i.e. if 2 € R then 


— O(ttn) — $ log? ”) Leis 


lim P 
log?/? n 


n—->oco 


Having established this central limit theorem, which by now is known 
as the Erdés-Turdn law, they went on to study the number W(n) of 
different values of O(7,,). (Thus W(n) is the number of non-isomorphic cyclic 
subgroups of S;,.) Erdés and Turdén proved that 


n log | 
W(n) = exp 4 74/ zl + O viniogloen ; 
3logn logn 


and, with the exception of o(W(n)) values, all are of the form 


exp {1 + x) See? ean : 


In S,, there are p(n) conjugacy classes, where p(n) is the partition function 
mentioned earlier, and studied in detail by Hardy and Ramanujan. As the 
order of a permutation a € S$, depends only on its conjugacy class K, it is 
natural to ask what the distribution of O(K) is if the p(n) conjugacy classes 
are considered equiprobable. Here we have written O(K) for the order of any 
permutation in K. Erdés and Turan proved that, with probability tending 
to 1, 


O(K) = exp((Ao + o(1)) Vn), 


where 


= 1.81. 


ics ESS (—1)7+1 
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All these results are proved by hard analysis, using Tauberian theorems 
and contour integration, somewhat resembling the Hardy-Ramanujan analy- 
sis; there is no reference to soft analysis or general theorems in probability 
theory or group theory that would get round the hard work. Thus it is not 
surprising that, over the years, many of the results of Erdés and Turan have 
been given shorter, more probabilistic proofs, that lead to sharper results. 
In particular, the Erdés-Turan law was studied by Best in 1970, Bovey in 
1980, Nicolas in 1985 and Arratia and Tavaré in 1992. To date, the last word 
on the topic is due to Barbour and Tavaré, who used the Ewens sampling 
formula, derived by Ewens in 1972 in the context of population genetics, to 
give a beautiful proof of the Erdés-Turan law with a sharp error estimate. It 
is fascinating that, in order to get a small error term, Barbour and Tavaré 
had to adjust slightly the approximating normal distribution: 


sup, |[P[{4 log® n}~1/? (log O(a) — 5 log? n+ log nloglogn) < a] — ®(x)| 
= O((logn)~*/?). 


Numerous other problems of statistical group theory have been studied, 
including the problem of random generation. Dixon proved in 1969 that 
almost all pairs of elements of S, generate S, or the alternating group 
A,, and recently Kantor and Lubotzky proved analogues of this result for 
finite classical groups. Because of problems arising in computational Galois 
theory, one is also interested in a considerably stronger condition than mere 


generation. The elements 21,...,% of a group G are said to generate G 
invariably if G is generated by yi,...,Y%m whenever y; is conjugate to 2;. for 
i = 1,2,...,m. Dixon showed in 1992 that for some constant c > 0, with 


probability tending to 1, c(logn)!/? randomly chosen permutations generate 


Sy, invariably. In 1993 Luczak and Pyber, confirming a conjecture of McKay, 
proved that for every « > 0 there is a constant C = C(e) such that C random 
elements generate S, with probability at least 1—. 

Luczak and Pyber also proved a conjecture of Cameron; they showed that 
the fraction of elements of S, that belong to transitive subgroups other than 
S, or Ap tends to 0 as n— oo. 

Needless to say, in spite of these powerful results, many important 
questions remain unanswered, indicating that statistical group theory is still 
in its infancy. 

When writing about the contributions of Erddés to mathematics, it 
would be unforgivable not to emphasize the enormous influence he exerts 
through his uncountably many problems. At the International Congress 
of Mathematicians in Paris in 1900, David Hilbert emphasized with great 
eloquence the importance of problems for mathematics. “The clearness and 
ease of comprehension insisted on for a mathematical theory I should still 
more demand for a mathematical problem, if it is to be perfect. For what is 
clear and easily comprehended attracts; the complicated repels us.” 
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For lack of space, we shall confine ourselves to one more of the problems 
of Erdés that have been solved, and to three particularly beautiful unsolved 
questions. 

There is no doubt that the most difficult Erdés problem solved to date 
is the problem on arithmetic progressions. In 1927 van der Waerden proved 
the following conjecture of Baudet: if the natural numbers are partitioned into 
two classes then at least one of the classes contains arbitrarily long arithmetic 
progressions. Over the years, this beautiful Ramsey-type result has been the 
starting point of much research. Quite early on, in 1936, Erdés and Turan 
suspected that partitioning the integers is an overkill: it suffices to take a 
“large” set of integers. Thus they formulated the following conjecture: if A 
is a set of natural numbers with positive upper density, that is, of 


|A9 [nll 


lim sup > 0, 


noo 
then A contains arbitrarily long arithmetic progressions. 

Roth was the first to put a dent in this Erdos-Turadn conjecture when, 
in 1952, he proved that A must contain arithmetic progressions of length 
3. Length 4 was much harder: Szemerédi proved it only in 1969. Having 
warmed up on length 4, in 1974 Szemerédi proved the full conjecture; the 
long and difficult proof is a real tour de force of combinatorics. The story 
did not end there: in 1977 Fiirstenberg gave another proof of Szemerédi’s 
theorem, using tools of ergodic theory; the methods of this proof and the 
new problems it naturally led to revolutionized ergodic theory. 

Let us turn then to the three unsolved Erdés problems we promised. The 
first asks for a substantial extension of Szemerédi’s theorem. Let a; < ag <... 
be a sequence of natural numbers such that 5> 1/a, = oo. Is it true then that 
the sequence contains arbitrarily long arithmetic progressions? It is not even 
known that Roth’s theorem holds in this case, i.e., that the sequence contains 
an arithmetic progression with three terms. If this is not enough to indicate 
that this problem is rather hard, it is worth noting that Erdés offers $5,000 for 
a solution. A rather special case of the conjecture would be that the primes 
contain arbitrarily long arithmetic progressions. 

The last two are also rather old conjectures, but each carries “only” a $500 
price-tag. Let f(n) be the minimal number of distinct distances determined 
by n distinct points in the plane. Erdés conjectured in 1946 that 

cn 


for some absolute constant c > O. The lattice points show that, if true, this 
is best possible. Chung, Szemerédi and Trotter have proved that f(n) is at 
least about n4/>, 

The third problem is from the 1961 paper of Erdés, Ko and Rado; it is, 
in fact, the last unsolved problem of that paper. (However, Ahlswede and 
Khachatrian have just announced a proof of the conjecture.) 
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Let A be a 2-intersecting family of 2n-subsets of [4n] = {1,2,...,4n}. 
Thus if A,B € Athen A,B C [4n], |A] = |B] = 2n, and |AN B| > 2. Then 
the Erdés-Ko-Rado conjecture states that 


2 
asi(2)-¥(0)" 
2\2n 2\n 

It is easily seen that, if true, this inequality is best possible. Indeed, let A 
be the collection of 2n-subsets of [4n], containing at least n+ 1 of the first 2n 
natural numbers. Then A is clearly 2-intersecting, and for every 2n-subset A 
on [4n], the system contains precisely one of A and its complement A, unless 
A (and so A as well) contains precisely n of the first 2n natural numbers. 

It is widely known that vast amounts of thought and ingenuity are 
required in order to earn $500 on an Erdés problem; even so, this problem 
may be far harder than its price-tag suggests. 

Although this brief review does not come close to doing justice to 
the mathematics of Paul Erdos, it does indicate that he has enriched the 
mathematics of this century as very few others have. He has clearly earned 
a mathematical Oscar for lifetime achievement, several times over. May he 
continue to prove and conjecture for many years to come. 


Added in Proof 


Sadly, this was not to be. On 20 September 1996, while attending a mini- 
semester at the Banach Center in Warsaw, Professor Paul Erdés was killed 
by a massive heart attack. Although in the last year he started to show signs 
of aging, his death was premature and entirely unexpected. 

We combinatorialists have just become orphans. 


B.B. 
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If I have seen further, it is by standing on the shoulders of Hungarians— 
Peter Winkler 


Paul Erdés was a giant of twentieth century mathematics. Born in 1913 he 
was a child prodigy whose talents were well cultivated in his native Budapest. 
Dubbed Der Zauberer von Budapest he moved onto the world stage in his 
early 20s. And there he stayed, extraordinarily active right until his death in 
1996. 

In 1999 when his longtime friend and collaborator Vera Sés organized 
a memorial conference we participants were stunned. Komjath on infinite 
graphs, Pomerance on special arithmetic functions, Hajnal on partition 
relations, Graham on Ramsey theory, Simonovits on Extremal Graph theory, 
Lubinsky on interpolation, Alon and this author on probabilistic methods— 
none of us individually had realized the breadth of Erdés’s contributions. 

Paul’s place in the mathematical pantheon will be a matter of strong 
debate for in that rarefied atmosphere he had a unique style. The late Ernst 
Straus described this style in a commemoration of Erdés’s 70th birthday. 


In our century, in which mathematics is so strongly dominated by 
“theory constructors” he has remained the prince of problem solvers 
and the absolute monarch of problem posers. One of my friends - a 
great mathematician in his own right - complained to me that “Erdés 
only gives us corollaries of the great metatheorems which remain 
unformulated in the back of his mind.” I think there is much truth 
to that observation but I don’t agree that it would have been either 
feasible or desirable for Erdés to stop producing corollaries and 
concentrate on the formulation of his metatheorems. In many ways 
Paul Erddés is the Euler of our times. Just as the “special” problems 
that Euler solved pointed the way to analytic and algebraic number 
theory, topology, combinatorics, function spaces, etc.; so the methods 
and results of Erdés’s work already let us see the outline of great new 
disciplines, such as combinatorial and probabilistic number theory, 
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combinatorial geometry, probabilistic and transfinite combinatorics 
and graph theory, as well as many more yet to arise from his ideas. 


Straus, who worked as an assistant to Albert Einstein, noted that Einstein 
chose physics over mathematics because he feared that one would waste 
one’s powers in pursuing the many beautiful and attractive questions of 
mathematics without finding the central questions. Straus goes on, 


Erdés has consistently and_ successfully violated every one of 
Einstein’s prescriptions. He has succumbed to the seduction of every 
beautiful problem he has encountered—and a great many have succumbed 
to him. This just proves to me that in the search for truth there is room 
for Don Juans like Erdés and Sir Galahad’s like Einstein. 


Tim Gowers, in his beautiful and influential essay “The Two Cultures 
of Mathematics” discusses this distinction between theory constructors and 
problem solvers in detail. He gives Erdés as the archetypal problem solver 
and gives a spirited defense of the problem solver mode of doing mathematics. 
He worries that such a defense is necessary: 

... mathematicians in the theory-building areas often regard what they are 

doing as the central core (Atiyah uses this exact phrase) of mathematics, 


with subjects such as combinatorics thought of as peripheral and not 
particularly relevant to the main aims of mathematics. 


Today, however, discrete math is a respected area of mathematics. Gowers 
himself was awarded the Fields Medal in 1998, Terence Tao was awarded 
the Fields Medal in 2006, Laszl6 Lovasz was awarded the Kyoto Prize in 
2010, all for work with a strong combinatorial component. In 2012 the 
Abel Prize was awarded to Endre Szemerédi. Szemerédi’s work is almost 
entirely combinatorial and follows closely in the footsteps of Paul Erdés. 
This respect certainly did not exist a 100, even 50 years ago. Discrete Math 
was often dismissed as “puzzle math” and the phrase “the slums of topology” 
(sometimes attributed to J. H. C. Whitehead) was widely and disparagingly 
used to describe it. 

The rise of the Discrete has, I feel, two main causes. The first was The 
Computer, how wonderful that this physical object has led to such intriguing 
mathematical questions. The second, with due respect to the many others, 
was the constant attention of Paul Erdés with his famous admonition “Prove 
and Conjecture!” Ramsey Theory, Extremal Graph Theory, Random Graphs, 
how many turrets in our mathematical castle were built one brick at a 
time with Paul’s theorems and, equally important, his frequent and always 
penetrating conjectures. 

My own research specialty, The Probabilistic Method, owes its existence 
to Paul Erdos. I have proposed the term Erdés Magic for the area as a 
tribute to Erdos. It began in 1947 with a 3 page paper in the Bulletin of 
the American Mathematical Society. Paul proved the existence of a graph 
having certain Ramsey property without actually constructing it. In modern 
language he showed that an appropriately defined random graph would have 


Erdés Magic 45 


the property with positive probability and hence there must exist a graph 
with the property. For the next 20 years Paul was a “voice in the wilderness” , 
his colleagues admired his amazing results but adoption of the methodology 
was slow. But Paul persevered—he was always driven by his personal sense of 
mathematical aesthetics in which he had supreme confidence—and today the 
method is widely used in both Discrete Math and in Theoretical Computer 
Science. 

My own introduction to Erdos was not atypical. I had managed to 
solve one of his ten dollar problems. Erdds would give such small monetary 
awards for solutions of his conjectures. To receive such an award was heaven 
incarnate. We met and I nervously explained my argument. Instantly he saw 
how my methods could be used on a very different problem. This became 
our first joint paper and I became one of his disciples. This occurred in the 
late 1960s, a tumultuous time when “do your own thing” was the admonition 
that resonated so powerfully. But while others spoke of it, this was Paul’s 
modus operandi. He had no job; he worked constantly. He had no home; the 
world was his home. Possessions were a nuisance, money a bore. He lived 
on a web of trust, traveling ceaselessly from Center to Center, spreading his 
mathematical pollen. 

What drew so many of us into his circle? What explains the joy we have 
in speaking of this gentle man? Why do we love to tell Erd6s stories? I’ve 
thought a great deal about this and I think it comes down to a matter of belief, 
or faith. We know the beauties of mathematics and we hold a belief in its 
transcendent quality. In Kronecker’s immortal words: Die ganzen Zahlen hat 
der liebe Gott gemacht, alles andere ist Menschenwerk. Mathematical truth 
is immutable, it lies outside physical reality. When we show, for example, that 
two n-th powers never add to an n-th power for n > 3 we have discovered a 
Truth. This is our belief, this is our core motivating force. Yet our attempts to 
describe this belief to our nonmathematical friends is akin to describing the 
Almighty to an atheist. Paul embodied this belief in mathematical truth. 
His enormous talents and energies were given entirely to the Temple of 
Mathematics. He harbored no doubts about the importance, the absoluteness, 
of his quest. To see his faith was to be given faith. The religious world has a 
name for such people—they are called saints. We knew him as Uncle Paul. 

I do hope that one cornerstone of Paul’s, if you will, theology will long 
survive. I refer to The Book. The Book consists of all the theorems of 
mathematics. For each theorem there is in The Book just one proof. It is the 
most aesthetic proof, the most insightful proof, what Paul called The Book 
Proof. When one of Paul’s myriad conjectures was resolved in an “ugly” way 
Paul would be very happy in congratulating the prover but would add, “Now, 
let’s look for The Book Proof.” This platonic ideal spoke strongly to those of 
us in his circle. The mathematics was there, we had only to discover it. 

The intensity and the selflessness of the search for truth were described by 
the writer Jorge Luis Borges in his story La biblioteca de Babel. The narrator 
is a worker in a library that contains on its infinite shelves all wisdom. He 
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wanders its infinite corridors in search of what Paul Erdés might have called 
The Book. He cries out, 


To me, it does not seem unlikely that on some shelf of the universe there 
lies a total book. I pray the unknown gods that some man—even if only 
one man, and though it have been thousands of years ago!—may have 
examined and read it. If honor and wisdom and happiness are not for 
me, let them be for others. May heaven exist though my place be in hell. 
Let me be outraged and annihilated but may Thy enormous Library be 
justified, for one instant, in one being. 


In the summer of 1985 I drove Paul to Yellow Pig Camp—a mathematics 
camp for talented high school students. It was a beautiful day—the students 
loved Uncle Paul and Paul enjoyed nothing more than the company of eager 
young minds. In my introduction to his lecture I discussed The Book but I 
made the mistake of describing it as being “held by God.” Paul began his 
lecture with a gentle correction that I shall never forget. “You don’t have to 
believe in God,” he said, “but you should believe in The Book.” 


I. Early Days 


Introduction 


We do not have much to add here. We only want to express our gratitude 
to Arthur Stone, Cedric Smith, William Tutte and Irving Kaplansky for 
their personal recollections of those days when many contemporary theories 
were being created. In those early days, Erdés’ interests were almost entirely 
devoted to number theory and, as seen from his own contribution, as well 
as from the other contributions to this chapter, number theory is an old 
love which does not fade. But so is an early combinatorial puzzle solved by 
four young Cambridge undergraduates (three of which are among authors 
of this chapter). C. Smith related this puzzle to P. Erdés while W. Tutte, 
in his characteristic style, connects his very recent research to his own early 
beginnings. 

But perhaps it is very fitting to include two quotations by two of Erdés’ 
closest and earliest collaborators and colleagues from their own recollections 
of the early days in Budapest in the 1930s. 
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I have known Paul Erdés long enough to be a bit personal. We met 
daily at the university, made excursions practically every Sunday 
with a group of other fellow students, steady members of which were 
(among others) Gy. Szekeres and T. Gallai. The main subject of 
conversations was mathematics; since Gallai and Erdés attended the 
graph theoretical lectures of D. Konig, graph theory was discussed 
often. The first graph-theoretical result of Erdés, an extension of 
Menger’s theorem to infinite graphs, arose as early as 1931; it was 
an answer to a question of Konig a few weeks after Konig raised it 
in his course. This was published in K6nig’s classical book in 1936 
(probably nowhere else)... 


Erdos’ main interest in the thirties was number theory but gradually 
more and more combinatorial moments occurred even in these works. 
As a clear sign of this I quote from a paper of his entitled “On 
sequences of integers no one of which divides the product of two 
others and on some related problems” which appeared in 1938: “The 
argument was really based upon the following theorem for graphs. 
Let 2k points be given. We split them into two classes each containing 
k of them. The points of two classes are connected by segments such 
that the segments form no closed quadrilateral. Then the number 
of segments is less than 3k°/?.” No doubt he was not far from the 
discovery of extremal graph theory before 1938. 


P. Turan (Art of Counting, p. xvii) 


We had a very close circle of young mathematicians, foremost among 
them Erdés, Turan and Gallai; friendships were forged which became 
the most lasting that I have ever known and which outlived the 
upheavals of the thirties, a vicious world war and our scattering to 
the four corners of the world. I myself was an “outsider,” studying 
chemical engineering at the Technical University, but often joined the 
mathematicians at weekend excursions in the charming hill country 
around Budapest and (in summer) at open air meetings on the 
benches of the city park. 


Paul, then still a young student but already with a few victories in 
his bag, was always full of problems and his sayings were already a 
legend. He used to address us in the same fashion as we would sign 
our names under an article and this habit became universal among 
us; even today I often call old members of the circle by a distortion 
of their initials. 


“Szekeres Gy., open up your wise mind.” This was Paul’s customary 
invitation—or was it an order?—to listen to a proof or a problem of 
his. Our discussions centered around mathematics, personal gossip, 
and politics. It was the beginning of a desperate era in Europe. Most 
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of us in the circle belonged to that singular ethnic group of European 
society which drew its cultural heritage from Heinrich Heine and 
Gustav Mahler, Karl Marx and Cantor, Einstein and Freud, later to 
become the principal target of Hitler’s fury. 


G. Szekeres (Art of Computing, p. xix, xx) 


And we are happy to include here a standard Erdés article of those days. 
Well slightly nonstandard this time: My favorite problems from all fields. . . 


Some of My Favorite Problems and Results 


Paul Erdés 


P. Erdés (Deceased) 
Mathematical Institute, Hungarian Academy of Sciences, Budapest Pf. 127, 
1364, Hungary 


1. Introduction 


Problems have always been an essential part of my mathematical life. A well 
chosen problem can isolate an essential difficulty in a particular area, serving 
as a benchmark against which progress in this area can be measured. An 
innocent looking problem often gives no hint as to its true nature. It might 
be like a “marshmallow,” serving as a tasty tidbit supplying a few moments 
of fleeting enjoyment. Or it might be like an “acorn,” requiring deep and 
subtle new insights from which a mighty oak can develop. 

As an illustration of how hard it can be to judge the difficulty of a problem, 
I’d like to tell the following anecdote concerning the great mathematician 
David Hilbert. Hilbert lectured in the early 1920s on problems in mathematics 
and said something like this—probably all of us will see the proof of the 
Riemann Hypothesis, some of us (but probably not I) will see a proof 
of Fermat’s last theorem but none of us will see the proof that QV? is 
transcendental. In the audience was Carl Ludwig Siegel, whose deep research 
contributed decisively to the proof by Kusmin a few years later of the 
transcendence of 2V2. In fact, shortly thereafter Gelfond and a few weeks 
later Schneider independently proved the a® is transcendental if a and @ are 
algebraic, ( is irrational and a ¥ 0,1. 

In this note I would like to describe a variety of my problems which I would 
classify as my favorites. Of course, I can’t guarantee that they are all “acorns,” 
but because many have thwarted the efforts of the best mathematicians for 
many decades (and have often acquired a cash reward for their solutions), it 
may indicate that new ideas will be needed, which can in turn, lead to more 
general results, and naturally, to further new problems. In this way, the cycle 
of life in mathematics continues forever. 


2. Number Theory 


My first serious problem was formulated in 1931 and it is still wide open. 
Denote by f(n) the largest number of integers 1 < ay < ag <-+-< agp <n 
k 


all of whose subset sums 57> €;a; are distinct, where €; = 0 or 1. The powers 
i=l 
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of 2 have this property and I conjectured that 


oe logn 


1 
oe ‘) 


for some absolute constant c. I offer $500 for a proof or disproof. The 
inequality 


logn  loglogn 


f(n) < 


log 2 log 2 = 


is almost immediate, since there are 2” sums of the form >>, ia; and they 
must be all distinct and all are less than kn. In 1954, Leo Moser and I proved 
by using the second moment method that 


f(n) < 


which is the current best upper bound. 
Conway and Guy found 24 integers all < 2? for which all subset sums 
are distinct, which implies f(2") > n+ 2 for n > 22. Perhaps 


f(2") <n+2- for all n? 


logn | loglogn | 
' 2log2 ° a 


log 2 


Perhaps the following variant of the problem is more suitable for computation. 
Let 1 < by < bg <--+ < bm be a sequence of integers for which all the subset 
m 


sums )> €,b;, €; = 0 or 1, are distinct. Is it true that 
i=l 


minb,, > 2™~° (2) 


for some absolute constant c? Inequality (2) is of course equivalent to (1). 
The determination of the exact value of minb,, is perhaps hopeless but for 
small m the value of minb,,, can no doubt be determined by computation, 
and I think this would be of some interest. 

Perhaps my favorite problem of all concerns covering congruences. It was 
really surprising that it had not been asked before. A system of congruences 


a; (mod nj), ny <ng<...< Mp (3) 


is called a covering system if every integer satisfies at least one of the 
congruences in (3). The simplest covering system is 0 (mod 2), 0 (mod 3), 1 
(mod 4), 5 (mod 6) and 7 (mod 12). The main problem is: Is it true that for 
every c one can find a covering system all of whose moduli are larger than c? 
I offer $1,000 for a proof or disproof. 

Choi [2] found a covering system with nj = 20, and a Japanese 
mathematician whose name I do not remember found such a system with 
n, = 24. If the answer to this question is positive, denote by f(n) the smallest 
integer k for which there is a covering system 


a; (mod nj), 1<i<k, m=t, k= f(t). 
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It would be of some mild interest to determine f(t) for the few values of 
t =n, for which we know that covering systems exist. 

Many further unsolved problems can be asked about covering systems. 
Selfridge and I asked: Is there a covering system all of whose moduli are odd? 
Schinzel asked: Is there a covering system where n; { n; for i 4 7? Schinzel 
used such covering systems for the study of irreducibility of polynomials. 
Related to this is a question of Herzog and Schénheim: Does there exist a 
finite group which can be partitioned into cosets of different sizes? 

More generally, let ny < ng < --- be a sequence of integers. Is there 
a reasonable condition which would imply that there is a covering system 
whose moduli are among the n;? Quite likely there is no such condition. 
Let us now drop the condition that the set of moduli is finite, but to avoid 
triviality we insist that in the congruence a; (mod n,), only the integers > n; 
are considered. When if ever can we find such a system? 

Perhaps it is of some interest to tell how I came upon the problem of 
covering congruences. In 1934 Romanoff [20] proved that the lower density 
of integers of the form 2* + p, with p prime, is positive. This was surprising 
since the number of sums 2" +p < x is cz. Romanoff in a letter in 1934 asked 
me if there were infinitely many odd numbers not of the form 2* + p. Using 
covering congruences I proved in [8] there is an infinite arithmetic progression 
of odd numbers no term of which is of the form 2" + p. Independently, Van 
der Corput also proved that there are infinitely many odd numbers not of 
the form 2* + p. In [3] Crocker proved there are infinitely many odd numbers 
not of the form 2" + 2*2 + p, but his proof only gives the number of integers 
< x which are not of this form is > cloglogx. This surely can be improved 
but I am not at all sure if the upper density of the integers not of the form 
2'1 + 9*2 + » is positive. One could ask the following (probably unattackable) 
problem: Is it true that there is an r so that every integer is the sum of a 
prime and at most r powers of 2? Gallagher [14] proved (improving a result 
of Linnik) that for every € there is an rg so that the lower density of the 
integers which are the sum of a prime and r, powers of 2 is at least 1 —e. No 
doubt, lower density always could be replaced by density, but a proof that 
the density of the integers of the form 2" + p exists seems untouchable. 

I think that every arithmetic progression contains infinitely many integers 
of the form 2*: + 2'2 + p. Thus, covering congruences cannot be used to 
improve the result of Crocker. 

On diophantine equations, my most important paper is my result with 
Selfridge: The product of consecutive integers is never a power. Perhaps in 


k 
fact for k > 3 the product [[(n +7), n > k, always has a prime factor 
i=l 
k 
p > k for which p? { [] (n +7). Unfortunately, this seems hopeless at present. 
i=l 


Perhaps the product [[ (n+ jk), m > 3, (ie., the product of more than 
j=l 
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3 terms of an arithmetic progression) is never a power. Ramachandra, Shorey 
and Tijdeman have important results in this direction (see the paper “Some 
methods of Erd6és applied to finite arithmetic progressions” by Shorey and 
Tijdeman in this volume). 

The following Tauberian theorem is connected to the elementary proof of 
the prime number theorem by Selberg and myself. Let ax > 0, s; = ee Qj 
and suppose ~ 


So an(k + 8n—z) =n? + O(n). (4) 
k=1 


Then s, =n+O(1). 
This result is clearly related to Selberg’s fundamental inequality 


S| (log p)? + y log plog g = 2x log x + O(z). 


p<ax pq<«x 


My original proof of (4) was very complicated. It was simplified by Siegel and 
later by Shapiro. I am certain further results of this type can be obtained. 
Let me now move on to some problems which originally arose in con- 
nection with van der Waerden’s classic theorem on arithmetic progressions. 
Nearly 80 years ago, Schur conjectured that if we color the integers with 
£ colors, there is always formed a monochromatic arithmetic progression of 
k terms. In 1927, van der Waerden found an ingenious proof, and, in fact, 
he proved that there is a least integer W(k, 2) such that for every partition 
of the integers 1 < t < W(k, £) into @ classes, there is always an arithmetic 
progression of k terms belonging to a single class. Let W(k) denote W(k, 2). 
The proof given by van der Waerden gave a very poor estimate for W(k), e.g., 
it was not even primitive recursive. It was a great achievement a few years ago 
when Shelah gave a primitive recursive bound for W(k). Probably, his bound 
is still much too large and perhaps W(k) < 22". The first nontrivial lower 
bound W(k) > 2/2 was given by Rado and myself. The current best bound 
is W(p+1) > p- 2°, p prime, due to Berlekamp. I believe lim,_-,.. is) =o 
has been proved by Beck (but this is not yet published). The first task would 
be to prove that W(k) > (2+¢)*, and perhaps W(k)!/* — oo. Graham offers 
v4 


$1,000 to prove W(k) <2? (where there are k 2’s in the tower) for all k. 
Over 60 years ago, Turan and I thought that this was a Turan type of 
problem and not a Ramsey type (this terminology did not exist at the time!). 
In fact, let r~(n) be the smallest integer for which every sequence of integers 
1<ay <ag<-++ <a: <nwitht > r,(n) contains an arithmetic progression 


rp(n) 


of k terms. It is easy to see that limn—+o0 exists for any k. We conjectured 
that tat) —+ 0 as n > oo. This of course would imply van der Waerden’s 


theorem. We did not at first realize the difficulty of our conjecture and hoped 
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that one might be able to get better estimates for W(k) this way. At first we 
thought r;,(n) < n'~® but 50 years ago, R. Salem and D. C. Spencer proved 


r3(n) > ni eloglogn 
and in 1946, Behrend proved 
r3(n) > nexp(—cy/logn) 
which is the current record. Forty years ago Roth proved 
r3(n) > en/ log log n. 
The current record is due to Heath-Brown and Szemerédi: 
r3(n) <n/(logn)*, ax 1/4. 


I offer $500 for a proof that r3(n) < n/(logn)° for every c, and $1,000 for 
any asymptotic formula for r;,(n). This is probably unattackable at present. 
In the early 1970s I offered $1,000 for a proof that r,(n)/n + 0 as n > oo 
and this was accomplished by Szemerédi in 1974. His proof is a masterpiece 
of combinatorial reasoning, and his regularity lemma, introduced in an early 
form in his proof, has subsequently found many applications in combinatorics 
and graph theory. Unfortunately, his proof used van der Waerden’s theorem 
and so could not be used for the estimation of W(k). Furstenberg also 
proved rz(n)/n + 0 as n + oo by using ergodic theory, and his methods 
already have many applications to various problems in combinatorial number 
theory. In fact, there is a growing number of results in combinatorial number 
theory which only can be proved by methods of ergodic theory (so far). 
Unfortunately, Furstenberg’s methods do not help in estimating W(k). 

An old conjecture in number theory states that for every k there are k 
primes in an arithmetic progression. This problem seems unattackable; the 
largest currently known arithmetic progression of primes has 22 terms. Many 
years ago I made the following conjecture which, if true, would settle the 


problem: Let a; < az <-+-- be a sequence of integers satisfying 
Co 
1 
hai * 


Then the a,’s contain arbitrarily long arithmetic progressions. I offer $5,000 
for a proof (or disproof) of this. Neither Szemerédi nor Furstenberg’s methods 
are able to settle this but perhaps the next century will see its resolution. 

Perhaps for every € > 0 there is an no(e€,k) so that if no(e,k) < ar < 
ag <--+ is a sequence of integers which does not contain a k-term arithmetic 
progression then }7; + <. 

Here is a related problem which is completely hopeless at present. Is it 
true that for every & there are k consecutive primes in arithmetic progression? 
In the present state of science, the problem is unattackable even for k = 3. 
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It is well known that there are infinitely many triples of primes forming an 
arithmetic progression but it is not yet known if this holds for quadruples. 


Let a, < ag < -:- be an infinite sequence of integers which does not 
contain a k-term arithmetic progression. Is it true that 
1 1 
So = < (= +0(1)) nW(k)? (6) 
A ay 2 


If true, (6) is trivially best possible, and would be a considerable strengthen- 
ing of (5). The inequality 


1 
S° — <clnW(k) 
—~ Ay 
a 

would already be a sensational result. 
I should point out that it is very difficult to determine exact values of 


W(k). So far, we only know 
W(3)=9, W/(4)=35, W(5)=178. 


W (6) is certainly unknown and perhaps beyond the range of computers. For 
more complete references on these topics, the reader can consult the books 
of Graham, Rothschild, and Spencer [15], NeSetiil and Rédl [19], Erdés and 
Graham [12], and Guy [16]. 

I now move on to Sidon sequences and related problems. Let A = {a, < 
a2 <---} bea finite or infinite sequence of integers, and let 


A(z) = S01. 


ay<u 


Denote by f(n) the number of solutions of n = a; + a;. I first met Sidon in 
1932. He posed two problems. The first was this: Does there exist an infinite 
sequence A for which for all n > no, f(n) > 0, but so that for every € > 0, 
f(n)/n® + 0 as n — oo. In other words, A should be a basis of order 2 but the 
number of solutions of n = a; + a; should be small. I liked the problem very 
much and optimistically assured Sidon that I would construct such a sequence 
in a few days. In fact, it took me 20 years to prove there is a sequence A for 
which 


clogn < f(n) < cglogn. (7) 


This is of course much stronger than what Sidon asked for, but I never have 
found a constructive proof of (7), or for that matter, even for Sidon’s original 


question. My proof shows that if you select a random sequence A in which 
each n is put in A with probability e( mee? 
(7). 


Turdn and I conjectured that if f(n) > O for n > no then 
limsup,_,o {(n) = oo. I offer $500 for a proof or disproof of this. I also 


then almost all such A satisfy 
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offer $100 for a constructive proof of Sidon’s original question. I conjecture 
there is no sequence A for which 


f(n)/logn > ¢, 0<c<oo. (8) 


In other words, (7) is nearly best possible. I offer $500 for a proof or disproof 
of (8). Sdrkézy and I proved that 


|f(n) — clog n| 
(log n)'72 


is not possible for any c, which is much weaker than (8). Perhaps an even 
stronger result than (8) holds. Put 


f(r) 


c, = liminf ——, c2 = limsup 
logn 


+0 (9) 


f(r) 

log n’ 

Then there is an absolute constant c so that for all A, co -—c, > «. 
This conjecture may be a bit too optimistic but I could never find a 
counterexample. 

Now let us discuss Sidon’s second question. Find a sequence A for which 
A(z) is as large as possible and for which f(n) = 0 or 1, ie., the sums a;+a,, 
i < J, are all distinct. Such sequences are now called Sidon sequences. Sidon 
was led to these problems by his study of lacunary trigonometric series. The 
greedy algorithm easily gives an infinite Sidon sequence A which for every x 
satisfies 


A(a) > ca¥/3, (10) 


It took about 50 years until Ajtai, Komlds and Szemerédi (finally) improved 
(10). They showed it is possible to have 


A(x) > c(xlog x)'/3, (11) 


Probably there is a Sidon sequence A for which A(a) > 2!/?-® but this at 
present is far beyond reach. Rényi and I proved that for every € > 0 there is a 
sequence A with A(x) > n!/2-® for which f(n) < c(e). I proved lim sup Aw = 


s which was strengthened by Kriickerberg to lim sup Aw > Zs but the truth 


is surely 1. Does there exist such a sequence with A(x) > x!/?/(loga)°? A 
sharpening of our old conjecture with Turdn would state: If a, < Cn? for 
all n then limsup f(n) = co. In fact, for what functions t(n) + oo does 
dy < n7t(n) imply limsup,,_,,, f(n) = co. 

Here is an old conjecture of mine: Let a; < a2 < --- be an infinite 
sequence for which all the triple sums a; +a; +a, are distinct: Is it then true 
that lim sup a,/ n°? = oo. I offer $500 for a proof or disproof of this. Turan and 
I observed that Sidon sequences behave quite differently. We proved that if 
ay < ag <-+++ <a, <nisa Sidon sequence, and we denote by S(n) = maxk, 
then 


S(n) <n? 4 en4, (12) 
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Chowla and I noticed that the construction of Singer for perfect difference 
sets immediately gives 


S(n) > n¥/? — ni/2-€, (13) 
Tam fairly sure that 

S(n) = n'/? + O(n®). (14) 
for every € > 0, and it might even be true that 

S(n) = n'/? + O(1). (15) 


I offer $1,000 for clearing up these two problems. Incidentally, Singer proved 
that there are p+ 1 residues a, a@2,...,@p41 (mod p* +p +1) (where p is 
a prime power) so that every nonzero residue t (mod p? + p+ 1) can be 
expressed uniquely in the form t = a;—a; (mod p?+p+1),1<i,j<p+l. 
This beautiful result easily gives (13). 

Given an infinite sequence S = a, <--- <a, <--- of positive integers, 
denote by R,(S') = R, the number of representations of the integer n in the 
form n = a;+a;. My result with Fuchs states that for any positive constant c, 
the relation 

N 

YR =cN +0(N/? log? N) 

n=0 
cannot hold (irrespective of the nature of S). I hope that the Erdés-Fuchs 
result will survive us by centuries. We proved our result when I visited 
Cornell in the summer of 1954 and the paper appeared in 1956. Fuchs 
also considered our result important. Montgomery and Vaughan now have 
a somewhat sharper result. 

Many years ago, I conjectured that every finite Sidon sequence a; < --- 
< a; can be completed into a perfect difference set of Singer. In other words, 
there is a p = q® and a Singer set 


Qy <2 < +++ <a < Gey < ++ <Gpy1 <p? +ptl 


which is a perfect difference set for p? + p +1. I now feel this conjecture is 
perhaps too optimistic and I would be very happy for a proof of the following 
weaker conjecture: Let a, < ag < --+ < a be a Sidon sequence. Then for 
every € > 0 there is a Sidon sequence a, < +--+ < ay < Qi41 < +++ < Gy for 
which a, < (1+ )n?. If true, this conjecture would imply that there is a 
Sidon sequence for which 

-  . 6 An 

lim. inf as 1. (16) 
In view of (12), this would be best possible. 

I proved that for every Sidon sequence, lim inf; 4. @n/n? = oo, and in 

fact, more precisely, 
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an 
li 0. 17 
poms ne nea > ( ) 
The reader will notice that (11) and (17) are very far apart. I conjecture 
that (11) and probably (17) can be improved a great deal. I expect there is 
a Sidon sequence so that for every € > 0 and n > no, 


é,2n", (18) 


I offer $1,000 for a proof or disproof of (18). 

Sarkozy and I recently conjectured that if a, < ag <--- andbi < bo <-:-- 
are two infinite sequences for which a,,/b, — 1, and if g(n) denotes the 
number of solutions of n = a; + b;, then if g(n) > 0 for all n then 

lim sup g(n) = oo. (19) 

noo 

If true, (19) would greatly strengthen our old conjecture with Turan: f(n) > 0 
for alln = limsup,,_,,, f(m) = oo. It is not even clear to me that this implies 
f(n) > 2 for infinitely many n. Our conjecture is perhaps too optimistic. 
Observe that the condition ag — 1 cannot be replaced by c1 < an/bn < C2. 
To see this, let the a;,’s be the integers of the form }>, ¢;2”’, €; = 0 or 1, and 
let the b;,’s be the integers of the form )>, €;27’+1. Clearly, every n can be 
uniquely expressed in the form a; + b;. Perhaps if € > 0 is sufficiently small 
then 


1—€<ay/b, <1+e 


and f(n) > 0 will imply limsup f(n) = co. (However, this conjecture also 
seems too optimistic.) 

Let me conclude this topic with one more conjecture. Let aj < a2 <-:-, 
be an infinite sequence of integers, and let h(n) denote the number of 
integers not exceeding n of the form a; + aj, Suppose that for every € > 0, 
h(n)/n+~® — 00 as n + oo. Is it then true that limsup f(n) = 00? Perhaps 
even this conjecture is too much to ask for and one instead should first look 
for a counterexample. For more complete references and many more results 
in this area, the reader should consult the excellent book of Halberstam and 
Roth [17]. (I understand a new and greatly enlarged edition is in preparation.) 

I conjectured long ago that if f(m) = +1 then for all c there is a d such 
that 


max > 1ka) >c. 
k=1 
In fact, perhaps 
£ 
max Fk > clogn. 
k=1 


60 Paul Erdés 


If true, this would be best possible. I certainly offer $500 to settle this 
annoying problem. 

Next I would like to make some remarks about probabilistic number 
theory (cf. the excellent books of Elliott [5, 6]). First, though, I have to discuss 
the preprobabilistic era (i.e., when I did not know probability). Denote by 
a(n) the sum of the divisors of n. If o(n) = 2n then n is perfect, if o(n) > 2n 
then n is abundant, and otherwise n is deficient. Bessel-Hagen stated in his 
book that it did not seem to be known whether or not the density of the set of 
abundant numbers existed. Behrend, Chowla and Davenport (using Fourier 
analysis) proved that the density did exist. I independently gave a different 
proof, using elementary methods. 

A number n is called primitive abundant ifn is abundant but every proper 
divisor of n is deficient. Denote by A(z) the number of primitive abundant 
numbers less than x. I proved 

x 2 


7a < A(x) < 


exp C1 (log x log log x) exp C2(log x log log x)!/2° 


Ivic simplified the proof and obtained better constants but 
A(x) = 


is still open. Michael Avidon, who is a student of Pomerance, found a better 
bound. The sum of the reciprocals of the primitive abundant numbers is 
therefore convergent and consequently the density of the abundant numbers 
exists. Later I proved that the distribution function of o(n) is purely singular. 

Denote by v(n) the number of prime factors of n. In 1917, Hardy and 
Ramanujan proved that for almost all n, 


£ 
exp(1 + 0(1))c(log log log x)1/2 


|v(n) — log log n| < f(n)(log log n)i/2 
for any f(n) — oo. Turdn found a very simple proof of this and in 1935, 
I proved that the density of integers n for which v(n) > loglogn is 1/2. Also 
I proved that the density of integers for which a(n) > a(n +1) is 1/2. I used 
Brun’s method and the Central Limit Theorem (which I did not know at the 
time) in the binomial case. This was easy without using probability theory. 
Now suppose f(n) is an additive function. I proved that the distribution 
function of f(n) exists provided the following three series converge: 


EL = 28, ye 
ina? ist Pipa P 
This is analogous to the “three series” theorem of Kolmogoroff, which of 
course I did not know. I conjectured that the convergence of the three series 
was both necessary and sufficient for the existence of the distribution function 
but this I could not prove because of gaps in my knowledge. In March of 
1939, Kac gave a talk at the Institute for Advanced Study in Princeton. 
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He stated the following conjecture: Let f(n) be an additive function for which 
|f(p)| < 1, f(p*) = f(p) (this is assumed only for the sake of simplicity), and 


ae, @, oP. (c) ov. 


Then the density of integers n for which 
f(n) < A(n) + e\/2B(n) 


is 


i exp(—a”)dx. 


He further stated that he could prove it if f(n) = )/,),, f(p) is replaced by 


f(r) = Vo pin f(p). I realized that if in fact Kac could prove this, then 
P<Pr 
by using Brun’s method, I could prove his conjecture. After the lecture 


we got together and soon saw that by combining our knowledge (i.e., the 
Central Limit Theorem and Brun’s method) we could indeed prove the 
conjecture. Thus, we would say with a little impudence that probabilistic 
number theory was born. Using our theorem with Kac, Wintner and I proved 
that the convergence of the three series mentioned really is both necessary 
and sufficient for the existence of the distribution function. Of course, many 
nice problems remain open in this field. I refer the reader to the book of 
Elliott. 

Davenport and I proved in 1935 that for any integer sequence 1 < a, < 
az <---, the sequence of multiples always has a logarithmic density which 
equals its lower density. Suppose a, < a2 <--- is an integer sequence with 
positive upper logarithmic density. Davenport and I proved that there i an 
infinite subsequence where aj, divides a;,,,. I also proved that if >; ahen 
is large then the sequence cannot be primitive, i.e., a; { a; cannot hold. 


Now the following problem is annoying: Let 1 < ay < ag <-:- bea 
sequence of real numbers and assume that for all i #4 7,k we have 


Note that if the a;’s are integers then (20) implies that no a; divides any a;, 
i # j. Does (20) imply 


1 il 1 
SS =e or So — 70 as © oo? 
7 log a; loga 4 Oy 


I offer $500 for settling this annoying diophantine problem. 

Let €, — 0 arbitrarily slowly. Then the density of integers which have 
a divisor in (n,n'*®") tends to 0 as n — oo. This is a best possible 
strengthening of a result of Besicovitch. I conjectured infinitely long ago that 
the density of integers which have two divisors d; and dz with d, < dz < 2d, 
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exists and is 1. This, and much more was finally proved by Maier and 
Tenenbaum (for which they collected $250). For more results in this subject, 
see the books of Halberstam-Roth [17] and Hall-Tenenbaum [18]. 

Let me close this section with a few problems about (my old friends) 
the primes. In fact, my very first paper was on a new proof of Chebyshev’s 
theorem: “Chebyshev said it and I'll say it again; there is always a prime 
between n and 2n”. Kalmar and I independently found in 1939 a very simple 
proof that [],<,p < 4", and I found a simple proof that ae = o, but 
perhaps these are trifles. 

Set dn = Pn41—Pn (where, as usual, p, denotes the nth prime). I proved 
in 1934 that for infinitely many n we have 


d(n) > 


and in 1938, Rankin added a factor of loglogloglogn to the numerator. I 
used to offer $10,000 for a proof that 


clog n log log n log log log log n 


clogn log logn 
(log log log n)? 


dn > 
(log log log n)? 
holds for every c and infinitely many n. However, I would now like to offer 
only $5,000 for this conjecture, and instead offer $10,000 for a proof that 


dn, > (logn)'** (21) 


for some € > 0. Of course, this would also imply the previous conjecture, so 
I suppose that (21) would actually cost me $15,000! 

Ricci and I proved that the set of limit points of d,,/log,, has positive 
measure. No doubt they are everywhere dense. Maier has the sharpest partial 
results and he has collected $250 more than once for some of these. Maier 
and I had dinner at Pomerance’s house one night and afterwards, Maier drove 
me to my hotel. I told Maier to stop at the library on the way so I could 
find my paper which would show that I owed him $250. Sure enough, we 
found the reference and I handed Maier the well-deserved $250. The next 
day Pomerance said jokingly that this was certainly an expensive taxi ride. 

In 1937, Kalmar and I proved that for every € > 0, there is an elementary 
proof for 

(1—e)—— < a(n) < (1 +e), 
Inn Inn 


n> no(€). (22) 
Our proof did not give an elementary proof of the prime number theorem 
since it was based on the following fact. Let 6 = d(€) be small but fixed. 
Then one can find a k so that for every k < t < k?, 
t 
S/ un) < 6t (23) 


n=1 
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where yz is the usual Mobius function. That such a k exists follows from the 
prime number theorem but (23) can be shown by a finite computation. This 
was a curious logical situation which was perhaps folklore but surely deserved 
publication. 

This is what happened. In the Spring of 1939 I met Rosser at a meeting 
at Duke University and I told him of our result. Rosser in fact already 
had a manuscript where he proved the analogous result for arithmetic 
progressions as well. Thus, Kalmar and I decided not to publish and agreed 
that Rosser could mention our result in his paper. However, Rosser’s paper 
never appeared. (It seems the referee, who was not a number theorist, had 
difficulties in reading it, and Rosser eventually lost interest). Diamond and I 
still felt that these results deserved publication and we tried to reconstruct 
our old proof with Kalmar. In any case, we found a proof which was perhaps 
not identical to the original and which appeared around 1980 in Enseignement 
Math. Incidentally, the Tauberian theorem of Ingham seems to imply that 
the prime number theorem should follow from the general asymptotic formula 
for n! but no one has ever found an elementary proof of Ingham’s Tauberian 
theorem. 


3. Polynomials 


Let me now turn to polynomials. Turan and I obtained many important 
results on the distribution of the roots of polynomials based on the asymptotic 
properties of the polynomials. I also obtained many inequalities about 
polynomials with Gallai (who in those days was called Griinwald), Offord and 
many others. As an example, I state an elementary result with Gallai (which 
somebody as a joke called it a generalization of a theorem of Archimedes): 
Let the polynomial f(a) have real roots f(—1) = f(1) = 0 and no other roots 
in [—1, 1], and suppose sup_j<,<, f(z) = 1. Then 


: f(a)de < 4/3 


with equality only for f(x) =1—2?. 

I have a long paper on polynomials with Herzog and Piranian in which we 
state many problems and results. Here I only want to mention one: For every 
€,, > 0 there is a polynomial f,(z) = []j_,(z — z),|zi| = 1, for which the 
measure of the set for which |f,,(z)| < 1 is less than €,,. It would be of some 
interest to determine the dependence of €,, on n, e.g., perhaps €,, > 1/(logn)°. 

Offord and I proved that among all polynomials 


n 
See €, = +1, 
k=1 
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for all but 0(2”/(lognloglogn)!/?) polynomials the number of real roots is 
2 
= logn + O((log n)?/ log log n) 
T 


(this sharpened earlier results of Littlewood and Offord). 

Clarkson and I (and independently, Laurent Schwartz) proved that if 
Yo, 1/nk < co and f(x) is a continuous function on (—1,1) which can be 
approximated by polynomials gn4(%) = ee a;z™ then f(x) is analytic in 
the unit circle. A very well known theorem of Miintz and Szdsz asserts that 
if }0,,1/nz = oo then every continuous function can be approximated by 
polynomials g,(x) = }>,a;2"', and that >71/n;, = oo is necessary for this, 

k 


as well. Our result makes this result clearer. 

To end this section I would like to mention an old result on polynomials 
which was later followed up by a number of mathematicians. Let f,(a) be 
a polynomial of degree n with only real roots, none in (—1,1), and with 
|f(a)| < 1 for -1<a<1. Then 


sup l(a) < &. 
—l1l<a<1 
Here, $ is best possible. If we take —1+¢ < « < 1—c then we get |f’(x)| < 
4+, Vn and we only have to assume that f(x) has no roots in the interior of 
the unit circle. 


4. Combinatorics 


One of my very favorite problems here is the following old conjecture of Faber, 
Lovasz and myself: Let G1,...,Gy be n edge-disjoint complete graphs on n 
vertices. We conjectured more than 20 years ago that the chromatic number 
of Ui, G is n. I offer $500 for a proof or disproof. Not long ago Kahn proved 
that the chromatic number of (Ji_, G; is at most (1 + 0(1))n. I immediately 
gave him a consolation prize of $100. It might be of interest to determine the 
maximum of the chromatic number of Cs G, if we require that G;(|G;, 
i # j, is triangle-free, or should have size at most 1, but it is not clear we get 
a nice answer in these cases. 

A family of sets A;, i = 1,2,..., is called a strong A-system if all the 
intersections A;(]A,;, i # j, are identical. The family is called a weak A- 
system if we only assume that all the sizes |A;()A,|, 1 # j, are the same. 
Rado and I [9, 10] investigated the following question: Denote by f(n,k) 
the smallest integer for which every family of sets Aj, 1 <i < f(n,k), with 
|A;| = 7 for all 7 contains k sets which form a strong A-system. In particular, 
we proved 


2” < f(n,3) < 2”n! 
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Rado and I conjectured that 

f(n,3) < cf (24) 
for some constant cz. No doubt, it is true that 

f(n,k) < . 


I offer $1,000 for a proof or disproof of (24). Recently, Kostochka proved (see 
his article in this volume) 


cloglogn vr 


Ss 
Ce) <r (oe 


I gave Kostochka a consolation prize of $100. More recently, Axenovich, 
Fonder-Flass and Kostochka improved this to 


f(m,3) < (nl) V9 


for every € > 0 provided n > no(e). 

Let f(n) > oo arbitrarily slowly. Is it true that there is a graph G of 
infinite chromatic number such that for every n, every subgraph of G of n 
vertices can be made bipartite by the omission of at most f(n) edges? I offer 
$250 for a proof or disproof. It would be of interest to prove or disprove the 
existence of a graph G of infinite chromatic number for which f(n) = o(n*) 
or f(n) = o((logn)’). 

Many years ago I asked: Is there a sequence A of density 0 for which 
there is a constant c(A) so that for n > no(A), every G(n, c(A)n) contains 
a cycle whose length is in A? (Here, G(n,e) denotes a graph with n vertices 
and e edges). This question seems very interesting to me and I certainly offer 
$100 for an answer. I am almost certain that if A is the sequence of powers 
of 2 then no such constant exists. What if A is the sequence of squares? I 
have no guess. Let f(n) be the smallest integer for which every G(n, f(n)) 
contains a cycle of length 2” for some k. I think that f(n)/n — oo but that 
f(n) < n(logn)* for some c > 0. 

Next, I would like to discuss some problems connected to Ramsey’s 
theorem. Denote by rl (pi,---,;Pe) = n the smallest integer so that if you 
color the k-tuples of |S| = n by @ colors, there will always be for some 
i a subset of S of size p; all of whose k-tuples have color i. (Often, we 
will omit writing the superscript (@).) The existence 7 (p1,---,;Pe) was 
first proved by Ramsey. It will be convenient to use the arrow notation 


introduced by Rado: n —> (pi,... po)? means that 7 (p1,---,Pe) <n. 


Of course, n + (p1,..-,pe)¥” means that rO (py... Dek > n. Also, we 
will use the square bracket notation introduced by Hajnal, Rado and myself: 
n— [pi,.--, pel? means that if we color the k-tuples of a set S' by @ colors, 
there always is for at least one 7, a subset S; of S of size p;, no k-tuple of 
which is colored with color 7. If all the p; are equal to the same p, we will 


simply write n > (p) and n + pl”. 
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As has been written elsewhere (see [11]), Ramsey’s theorem was rediscov- 
ered in 1933 by Szekeres. He and I proved 


2n — 2 
en2”!? <ro(n,n) < ( . ) (25) 


n-1 


or in the arrow notation 


2n — 2 
( i: ) — (n)z and cn2”/? + (n)2. 


n—-1 


In other words, if one 2-colors the edges of a complete graph on eas 


vertices, there is always a monochromatic complete subgraph K(n) on n 
vertices. 

Denote by f(n) the smallest integer for which f(n) > (n)3 holds, so that 
f(n) = 1r3 (n,n). I offer $100 for a proof that lim, +o. f(n)!/” exists, and $250 
for the value c of this limit. It follows from (25) that J/2<c<A. Perhaps c= 
2? Very little progress has been made in resolving these questions. Spencer has 
improved the constant in (25), and Thomason showed f(n) < (°"—7) /n4/2-*. 
My proof of the lower bound of (25) is nonconstructive. I offer $100 for a 
constructive proof that f(n) > (J+e)". Frankl and Wilson have a constructive 


proof that f(n) > n°!°S”. It is now known that 

2 2 

cin con 

2 8n) eS 
logn 


logn- 

The upper bound is due to Ajtai, Komlés and Szemerédi. The recent lower 
bound was proved by J. H. Kim using very clever probability arguments. It 
would be nice to have an asymptotic formula for r2(3, 7). I used to think that 
the probability method would give 


r2(4,n) > n?-* 


and, in fact, more generally, 


nk-l 


(log n)? 
for fixed k as n — oo. It now seems that I am wrong and new ideas will 


be required. The current record for a lower bound of r2(4,n) is en®/? due to 
Spencer. The proof of Ajtai, Komlés and Szemerédi gives 


ro(k,n) > 


ro(é,n) < en*~!/logn 
for fixed &. 
For more general Ramsey numbers, much less is known. Hajnal, Rado 
and I proved 
ger” < r3(n,n) < 27". (26) 


We believe the upper bound is closer to the truth, although Hajnal and I 
have a result which seems to favor the lower bound. We proved that if we 
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color the triples of a set of n elements by two colors, there is always a set 
of size s = |(logn)!/?| on which the distribution is unbalanced, i.e., one of 
the colors contains at least (5 + €)(3) triples. This is in strong contrast to 
the case k = 2, where it is possible to 2-color the pairs of an n-set so that in 
every set of size f(n)logn, where f(n) — co, both colors get asymptotically 
the same number of pairs. We would begin to doubt seriously that the upper 
bound in (26) is correct if we could prove that in any 2-coloring of the triples 
of an n-set, some set of size s = |(logn)*| for which at least (1—1)(3) triples 
have the same color. However, at the moment we can prove nothing like this. 
Hajnal proved 


r3(n,n,n,n) > 2°" 


which very strongly favors the upper bound in (26). 

I now turn to an old conjecture of Graham and myself which lies at the 
interface of Ramsey theory and number theory. Is it true that no matter how 
one k-colors the integers > 2 one can always find a solution to 


1 
i= S- —, for a finite monochromatic subset A? (27) 
acA 


We could never prove this even for k = 2. If the answer is in the affirmative, 
then determine or estimate the smallest integer f(&) for which any k-coloring 
of {2,3,..., f(k)} has the desired property. One can conjecture that if the 
integers are k-colored then for one of the color classes, every positive rational 
can be represented as a finite sum of the 7 <4 4, In fact, let 1 <<a, <---< 
Gp <n be a sequence of integers satisfying y + > f(n). Is it true that if 


f(n) 
(log log n)? 
then there is always a subsequence of the a;’s the sum of whose reciprocals 
sum to 1. The strongest conjecture we could not disprove states there is an 
absolute constant c so that if 


S- . > (c+ €)(log log n)? 


i 
ay<n 


then (27) has a solution among the a,’s, but if c+ € is replaced by c—e then 
this no longer holds. Perhaps all this is a bit too optimistic, but we do believe 


that if 
1 
» — >clogn 
ay 
ay<n 
then (27) has a solution in the a;’s, which if true, would show that our problem 
is a “Turdn”-type problem. 
I have worked quite a lot on problems in extremal graph theory. Since 
there are several excellent sources for such problems (the book of Bollobdés 
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[1] and the survey paper of Simonovits [21]). I will restrict myself to just a 
few nice problems here. 

Let H be a graph. The Turdn number T,,() of H is the smallest integer 
€n for which every G(n,e,) contains H as a subgraph. Simonovits and 
I conjectured long ago that if H is bipartite and every induced subgraph 
of H has a vertex of degree < r, then 


TH) < en? V2, (28) 


This conjecture is open even for r = 3. We further conjecture that if H has 
an induced subgraph, every vertex of which has degree > r, then 


Ty (H) > n2te-V(r-1) 


I offer $500 for a proof or disproof of each of our conjectures. 
Denote by f,(H) the number of graphs on n vertices which do not contain 
H as asubgraph. Clearly f,(H) > 27(H). I conjectured 


fall) < 20 One), (29) 


This is open even for H = Cy. It is well known that T;,(C4) = (4 +0(1))n3/?. 
On the other hand Kleitman and Winston only proved 


fn(Ca) < 200°. 


Simonovits and I also conjectured that every G(n,T,(H)) contains at least 
two copies of H. This is also open for C4. 


5. Geometry 


I want to conclude this paper with some problems in geometry. Since there are 
several excellent sources for such problems (the book by Croft, Falconer and 
Guy [4] and the survey article of Erdés-Purdy [13]), I will limit my remarks. 

Let 21,...,%, be n distinct points in the plane. Denote by f(n) the 
minimum possible number of distinct distances d(x;,;). Thus, any set of n 
points in the plane determines at least f(n) distinct distances. I conjectured 
[7] in 1946 that 


f(n) > cn/Vlogn. (30) 


The lattice points show that (30) if true is best possible. I offer $500 for a 
proof or disproof of (30). 

Denote by g(n) the maximum number of times the same distance can 
occur, i.e., g(n) is the maximum number of pairs for which d(a#;,2;) = 1, 
where the maximum is taken over all configurations x1,...,%,. I conjectured 
in my 1946 paper that 


g(n) < nite/ log log n (31) 
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for some c > 0. The lattice points again show that (31) if true is best possible. 
I offer $500 for a proof or disproof of (31). The best results so far are f(n) > 
n3/4 and g(n) < n°/4 +¢ for any € > 0 and n > no(e). 

Let 21,...,2, be n distinct points in the plane. Let h(n) be the largest 
integer so that for every x; there are > h(n) points equidistant from 2;. Is 
it true that for n > no(€) we have h(n) = o(n*)? I offer $500 for a proof but 
only $100 for a counterexample. 

Szemerédi conjectured that if 71,...,2, are n points in the plane with 
no three on a line then they determine at least |n/2]| distinct distances (but 
he could only prove n/3). 

Let 21,...,2, be a convex polygon in the plane. Consider the a 
distances d(2j, x;) and assume that the distance u; occurs s; times. Clearly 


E-(() 


I conjectured and Fishburn proved that 
» s <n. 
i 


I also conjectured that 5°, s? is maximal for the regular n-gon for n > no. If 
convexity is not assumed then I conjectured that 


S- sen (32) 


for any € > 0 and n > no(e). I offer $500 for a proof or disproof of (32). 

An old conjecture of mine states that if z1,...,2 are n points with no 
five points on a line then the number of lines containing four of the points is 
o(n?). I offer $100 for a proof or disproof. An example of Griinbaum shows 
that the number of these lines can be > cn°/? for some c > 0, and perhaps 
n®/? is the correct upper bound. Sylvester observed that one can give n points 
in the plane so that the number of points passing through exactly three of 
our points is as large as ie — cn for some constant c > 0. 

Purdy and I considered the following related problem. If we no longer 
insist that no five of the x; can be on a line then the lattice points in the 
plane show that we can get cn? distinct lines each containing four (or more) 
of our points, and in fact, c;n? containing exactly four of them. Denote by 
f(n) the maximum number of distinct lines which pass through at least four 
of our points. Determine or estimate f(n). Perhaps if there are cn? distinct 
lines each containing more than three points, then there is an h(n) — oo so 
that there is a line containing h(n) distinct points. We can not even prove 
h(n) > 5 but we suspect h(n) + oo, and perhaps h(n) > en!/? for some 
€ > 0. It is easy to see that h(n) < cn'/? for some c > 0. 

Finally, let me state the Erdés-Klein-Szekeres problem. This has quite a 
nice history which can be found in [11]. Let f(n) be the smallest integer r 
so that any configuration of r points in the plane with no three on a line 
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must contain the vertices of a convex n-gon. It is known that f(4) = 5 and 
f(5) = 9. Szekeres conjectured that f(n) = 2"~? +1. It is known that 


2n —4 
-AzLve<e & 1 
+ < sin) < ("> )) 4 


and these bounds have remained unchanged for many years. I would certainly 
pay $500 for a proof of Szekeres’ conjecture. 
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1. Dedication 


This paper has no connection with the two papers jointly authored by Paul 
Erdés and myself; nor does it overlap any of the many conversations we had. 
But I feel it is appropriate to dedicate the paper to him. It has the flavor of the 
mathematics we both particularly enjoyed: very explicit problems challenging 
us to answer “yes” or “no”. 

Let me take the occasion to express my admiration to Paul for his style, 
his enthusiasm, and his incredible ability to do something effective on just 
about any problem posed to him. I shall mention just one sample: the so-called 
Erdés-Kaplansky lemma which appears on page 67 of Jacobson’s Structure of 
Rings. This came about when I mentioned to him the then unsolved problem 
of determining the dimension of the full dual of an infinite dimensional vector 
space over a division ring D. The key point needed is to exhibit as many 
linearly independent sequences as the cardinal number of D. When D is 
commutative, Vandermonde does the trick, but a different idea is needed 
to cope with noncommutativity. He listened politely, but of course we all 
(or almost all) do that. But within twenty-four hours he did much more: 
he returned with a novel workable idea. Needless to say, Paul is equally at 
home in the finite or infinite, and this was a dandy example of a delicate 
Zornification. (Incidentally, although it is very late in the day, I take this 
publication as an opportunity to publicly ask Jake to delete my name in a 
future printing. The person who merely asked the question should not have 
his or her name attached to a theorem.) 

To conclude this dedication I mention a theorem which I learned from 
him (along with a slick proof): any countably infinite set has continuum 
many infinite subsets such that any two have finite intersection. There have 
been two occasions where this was just what I needed: on page 41 of Linear 
Algebra and Geometry—A Second Course, and in the paper Representations 
of separable algebras in volume 19 of Duke. If I had never met Paul, two 
problems might still be awaiting solution. 

Happy birthday, Paul! 
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2. Introduction 


In a recent paper [1] Arno settled two questions that go back to Gauss: 
he showed that the classical list of 54 imaginary quadratic fields with class 
number 4 is complete, and deduced the completeness of the following well 
known list of 33 numbers not divisible by 4 which are uniquely expressible as 
a sum of three squares: 


1 2, 3, 5, 6 410, 11, 13, 14, 19, 21, 
22, 30, 35, 37, 42, 43, 46, 58, 67, 70, 78, (1) 
91, 93, 115, 133, 142, 163, 190, 235, 253, 403, 427. 


Remark 1. (a) Through some mysterious slip the number 19 disappeared 
from Arno’s list. 
(b) Until Sect. 7 uniqueness is always to be taken in the simple minded sense, 
ignoring order and signs. 
(c) To get all integers which are uniquely expressible as a sum of three 
squares, take all products of the numbers in (1) by powers of 4. 


It is timely to seek similar results for ternary forms other than x? + y? + 
2”. Now there is substantial literature concerning formulas for the number 
of representations of integers by ternary forms. The relation between this 
literature and what I do here will be discussed in Sect. 7. In the body of the 
paper, in an elementary and self-contained way, I deduce from Arno’s theorem 
results on three ternary forms. The final section explores three other forms. 

The three selected forms are x? +y?+227, x?+2y?+2z?, and a?+42y?+42?. 
I chose these because in Dickson’s book [4, pp. 96-97] they are discussed right 
after sums of three squares and used later in the book in treating Waring’s 
problem for cubes. So this is my homage to Dickson, who introduced me to 
integral quadratic forms in a course during the summer of 1938, possibly the 
last time he presented his famous elementary course on number theory. 

In stating Theorems 1-3 three lists extracted from (1) play a role. The 
first consists of the even numbers in (1): 


2,6, 10, 14, 22, 30, 42, 46, 58, 70, 78, 142, 190. 2) 
For the second list divide the entries of (2) by 2: 
1,3,5, 7, 11, 15, 21, 23, 29, 35, 39, 71, 95. 3) 


The last consists of the entries in (1) which are congruent to 1 (mod 4): 


1,5, 13, 21, 37, 93, 133, 253. 4) 


Theorem 1. The numbers uniquely represented by x? + y? + 2z? consist of 
(2) and 4°6. (k= 0,1,2....,). 


Theorem 2. The numbers uniquely represented by x? + 2y? +42? consist of 
(2), (4), and 4°3 (k= 0,1, 2....). 


Integers Uniquely Represented by Certain Ternary Forms 73 


Theorem 3. The numbers uniquely represented by x? + 2y? + 4z? consist of 
(3) and four even numbers: 2, 10, 26, 74. 


Note that in Theorem 3 the list is finite. 

Although the information is not needed in this paper, it is probably 
helpful for the reader to have available the integers not represented by the 
four forms under discussion. 


ety? +27 and 2% +2y?74+227: 4*(8n4+7), 


ety? +222 and 27+2y72+42?: 4*(16n+4+ 14). 


3. Four One to One Correspondences 


In this section we establish various equalities between numbers of representa- 
tions. In each case a map will be defined, followed by a display of its inverse. 
In the first instance the routine verification that the product both ways is 
the identity will be presented; after that the details will be left to the reader. 

Let A be a given odd integer. We define three numbers NV, P, Q as follows: 
N is the number of representations of 2A by x? + y? + z?, P the number of 
representations of A by x? + y? + 227, and Q the number of representations 
of A by x2 + y? +42”. 


Lemma 1. N= P=Q. 


Proof of N = P. Given a representation A = u? + v? + 2w? we pass to 2A = 
(u+v)?+(u—v)?+(2w)?. For the reverse map, suppose that 2A = r?+s7+1? 
is given. Note that two of r,s,t must be odd and the third even. Say ¢ is 
even. We pass to A = [(r + s)/2]? + [(r — s)/2]? + 2(t/2)?. After u,v, ww > 
u+v,|u —v|,2w, to perform the second map we note that u and v have 
opposite parities, so that u+v and |u — v| are odd. Thus the second map 
sends u+v, |u—v|, 2w back into u, v, w. For the product the other way, 
r,s,t > (r+s)/2,|r — s|/2,t/2 and then back to 1, s, t. 


Proof of P = Q. Given a representation A = u? + 2u? + 4w?, we pass to 
A = u? + 2vu? + (2w)?. For the reverse, given A = r? + s? + 2t? we note 
that r and s have opposite parities. Assume that s is even and pass to A = 
r? + 4(s/2)? + 2t?. 


Lemma 2. Let B be any integer. Then the number of representations of B 
by x7 +y? +227 equals the number of representations of 2B by x? +2y?+2z?. 


Proof. Given B = u? + v? + 2w? we pass to 2B = 2u? + 2u? + (2w)?. For 
the reverse, given 2B = r? + 2s? + 2t? we note that r is even and pass to 
B= 2(r/2)? +87 +77. 
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Lemma 3. Assume C=1 (mod 4). Then the number of representations of 
C by x2 +y?+2 equals the number of representations of C by x? +2y? +227. 


Proof. From C = u? + 2v? + 2w? we pass to C = u2+(u+w)?4+ (v—w)?. 
For the reverse, write C = r? + s? +t? and note that C = 1 (mod 4) implies 
that two of r,s,t are even and the third odd. Say r and s are even. We pass 
to C = 2[(r + s)/2]? + 2[(r — S)/2)? + #. 


A. Partial Results Obtainable Without Reference 
to Class Numbers 


The next lemma will be used in proving Lemma 5. 


Lemma 4. In a representation by x? + y? + 22? of a number divisible by 8, 
x,y, and z must be even. 


Proof. We have 
EDae ty 42, (5) 


Necessarily x and y have the same parity. Suppose that they are odd. Then, 
since the square of any odd number is congruent to 1 (mod 8), we have x? + 
y” = 2 (mod 8). The element 2z? is congruent to 2 or 0 (mod 8), according 
as z is odd or even. In either case (5) is contradicted. Therefore x and y are 
even and it follows that z is even. 


Lemmas 5 and 6 are of course portions of Theorems 1 and 2. They are 
presented at this point to emphasize that their proofs do not require class 
number information. All that is needed is the following sharpening of the 
three square theorem, first published by Legendre in 1798: if a number is not 
divisible by 4 and not congruent to 7 (mod 8) then it has a representation 
as x? + y? + z? with the greatest common divisor of x, y, and z equal to 1. 
This quickly implies that the entries in (1) are square free. See [2, p. 304] for 
more details. 

In short: Lemmas 5 and 6 could have been proved 200 years ago. 


Lemma 5. The even numbers uniquely represented by x? + y? + 2z? are the 
multiples of 6 by powers of 4. 


Proof. Let 2F be uniquely represented by x? +y?+2z?. Note in the first place 
that F' is a sum of three squares. This follows from the form of the numbers 
represented by x? + y? + z? and x? + y? + 227, as exhibited at the end of 
Sect. 2. But we can see this at once directly: if 2F = u? + v? + 2w?, u and v 
must have the same parity and we deduce F' = [(u+v)/2]? + [(u—v) /2]?+w?. 
A similar remark is needed in Lemma 6 and will be left to the reader. 

Write F = r?457+t?. Then 2F = (r+s)?+(r—s)?+2t?. We can permute 
r, s,and t and sor and s are eligible to play the role of t. We get more than 
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one representation of 2F by x? + y? + 2z? unless r = s = t = G, say. Thus 
F = 3G’. Furthermore, this argument is applicable to any expression of F as 
a sum of three squares and so F = G? + G? + G? is the only representation 
of F a sum of three squares. By the remarks preceding the statement of 
Lemma 5, we see that F’ is a multiple of 3 by a power of 4, and so 2F has 
the form 4*6. We still have to verify that these numbers 46 are uniquely 
represented by x? + y? + 2z7. The number 6 is checked by inspection, and 
Lemma 4 looks after the others. 


Lemma 6. Assume that H is congruent to 3 (mod 4) and that H is uniquely 
represented by x? + 2y? +227, then H = 3. 


Proof. The argument is similar. We observe that H is a sum of three squares. 
When we write H = r? + s? + ¢? all three are odd. With any choice for r we 
can write 

H =r? + 2[(s + t)/2]? + 2[(s — t)/2]?. 


Therefore r = s = t = J, say. Moreover H = J? + J? + J? is the only 
representation of H as a sum of three squares. It follows that H must be 3. 


This section concludes with another preliminary lemma. 


Lemma 7. Suppose that 2K is uniquely represented by x7 +2y?+4z?. Then 
K is uniquely represented by x? +2y?4+2z?, and in this unique representation 
yz. 


Proof. Suppose that 
K =? + Qu? + 2Qw? =r? + 25? + 227. 
Then 
2K = Qu? + (2v)? + 4w? = Qu? + 4v? + (2w)? 


= 2r? + (2s)? + 4? = Qr? + 4s” + (22)?. 


The assumed uniqueness shows that u=r and that v=w=s=t. 


5. Proofs of Theorems 1-3 


Proof of Theorem 1. Lemma 1 has looked after the odd numbers uniquely 
representable by x? + y? + 2z?, showing that they are given by (3). Lemma 5 
looks after the even numbers. 


Proof of Theorem 2. Lemma 2 takes care of even numbers, Lemma 4 ac- 
counts for the odd ones congruent to 1 (mod 4), and Lemma 6 asserts that 
for those congruent to 3 (mod 4) the only possibility is 3. 


76 Irving Kaplansky 


Proof of Theorem 8. Lemma 1 looks after odd numbers. For even numbers, 
Lemma 7 is ready. It calls for us to examine all the numbers in Theorem 2 and 
check whether in their unique representation by x? + 2y? +22? we have y = z. 
The survivors get doubled. The work is facilitated by initially discarding all 
numbers in Theorem 2 which are not represented by the binary form x? +4y?. 
It turns out that the survivors are 1, 5, 13, and 37; their doubles are 2, 10, 
26, and 74. 


6. The Literature 


As remarked in Sect.2, there is a substantial literature on the number 
of representations of integers by ternary forms. Here the representations are 
being counted in the inflated sense, meaning that both the order and the signs 
of the summands are taken into account. During the 1920s there was a flurry 
of work, including the papers [3, 5], and [9] by Bell, Jones, and Uspensky. 

(It gives me pleasure to cite a research paper by Eric Temple Bell; 
I had the privilege of meeting him during a 1950 visit to Caltech. Several 
generations of readers, including myself, have appreciated his expository and 
historical writing. Let me mention also his science fiction, written under 
the pseudonym John Taine. I specially recommend Green Fire; The Purple 
Sapphire and Quayle’s Invention are also excellent. The Iron Star is still in 
print. Constance Reid has informed me that her current project is a biography 
of Bell.) 

A fairly definitive result appears as Theorem 86 on page 194 of [6]. It 
applies to all positive definite ternary forms. As is to be expected, what is 
being counted is the total number of representations by all the forms in a 
genus. Separating out a single form in a genus is an extra enterprise; the 
papers [7] and [8] make a contribution to this. 

Let me cite a sample from [5], using the notation of that paper. Jones 
writes N(n) for the number of representations of n by x?+y?+z? and A(n) for 
the number of representations of n by x? + y? + 2z?. Then A(n) = N(2n)/3 
when n is odd; this is quite parallel to the N = P portion of Lemma 1, 
and leads quickly to Theorem 1 for odd numbers. The next statement is 
A(2n) = N(n). This has no counterpart in my paper; it is a typical example of 
the fact that counting all the representations usually leads to neater formulas. 
But if the target is unique representation in the simple minded sense then 
for even numbers we must argue as in Lemma 5. 

It is only for a handful of forms that we are ready to derive results on 
unique simple minded representations; for the others we have to wait for more 
information on class numbers. 
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7. Concluding Remarks 


(a) The form x? +2y?+3z?. This form is treated in [4, p. 101] by a reduction 
to x? + 2y? + 2z?. Nevertheless, this is one of the forms that is waiting 
for more information on class numbers. But I was curious and Noam 
Elkies generously took the time to program and run a computation up 
to 16,383(= 24 — 1). For odd numbers the resulting list was (6). 


1,5, 7, 13, 17, 23, 47, 55. (6) 


For even numbers the result is recorded in Theorem 5 below. This was 
sufficiently striking that I decided to return to the drawing board to 
see whether I could prove it now. It turned out that there was enough 
information available to derive the needed preliminary theorem on a 
case with two representations. 


Theorem 4. The even numbers with exactly two representations by x7 + 
y? +22? are 


2,4, 12, 22, 38, 44, 86, 134, 326 (7) 
and multiples of the entries in (7) by powers of 4. 
From this Theorem 5 was deducible. 


Theorem 5. The even numbers uniquely represented by x? + 2y? + 3z? are 
the odd powers of 2. 


Theorem 5 is an addition to the growing list of theorems suggested by a 
computer and then proved. 

The proofs of Theorems 4 and 5 follow the pattern of those of Theo- 
rems 1 —3, with appropriate minor changes. I am omitting them, offering 
them as exercises for the reader. 


(b) The forms x? + 2y? + 32? and x? + 2y? + 327. 


One way of measuring progress is by the size of the discriminant. The 
forms x? + y? + 27 and x? + y? + 22? are the only ones with discriminants 
1 and 2, respectively. There are two forms of discriminant 3: x? + y? + 32? 
and a? + 2y? + 2z?. However, the latter is so closely related to x? + 2y? +32? 
that it merits no attention at this time. So I regard x? + y? +32? as the next 
challenge. It, too, has to wait for class number information but right now it 
is another tempting target for computation. 

It is convenient to bring in x? + 3y? + 32? as well, because then we can 
ignore multiples of 3, which simply bounce us back and forth between these 
two forms. Elkies found (8) and (9) for the numbers prime to 3 upto 16,383 
uniquely represented by x? + y? +32? and 2? + 3y? +3z?, respectively. (Note 
that the entries in (9) have to be congruent to 1 (mod 3) to be eligible for 
any kind of representation.) To get the full answer for x? + y? + 32? adjoin 
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multiples of (8) by even powers of 3 and multiples of (9) by odd powers of 3. 
For the other form the parities are reversed. 


1, 2, 7% 10, 11, 14, 19, 22, 23, 26, 31, 34, 38, (gy 
46, 47, 55, 59, 70, 71, 86, 94, 115, 119, 154, 166. 


1, 10, 13, 22, 34, 37, 46, 58, 82, 85, 130, 142, 190, 253. (9) 


The lists (6), (8) and (9) could be compared with those obtainable by 
assuming the completeness of the existing lists of imaginary quadratic fields 
for class numbers bigger than 4. I have not undertaken this comparison as 
yet. 


(c) Numbers of the form 8n + 3 with two representations as a sum of 
three squares. These numbers need not wait for more class number 
information: they can be treated right now. The task is entirely 
straightforward. There is, however, one thing to note. It will not be 
the case here that the numbers are square free. This introduces a 
complication; the technique needed to cope with it is available in pages 
307-308 of [2]. 

(d) The form xy + xz + yz. Here is a different chapter in the subject: 
the representation of an integer n by xy + az + yz. This is a ternary 
quadratic form, but it is indefinite. If we allow x, y, and z to take both 
positive and negative values, then any n would have an infinite number 
of representations. The situation is remedied by a restriction to positive 
values. More precisely, we take n positive and allow z, y, z to be 0 as well 
as positive (note that in fact only one of the three can be 0). Then there is 
in this context again a connection between the number of representations 
and class numbers. Indeed the number of representations of n (in the 
inflated style where the order counts, but of course this time signs do 
not enter) is three times the number of equivalence classes of positive 
definite binary quadratic forms of discriminant n. To be honest, for this 
to work perfectly requires a little creative accounting: representations 
that use a zero are given weight 1/2, and the forms a(x? + y?) and 
a(x? + xy +y?) get weights 1/2 and 1/3, respectively. 


The idea goes back to Liouville. It was taken up again in the 1920s by 
Mordell and Bell. The note [10] by R. F. Whitehead presents a model concise 
proof in half a page. Starting with Whitehead’s note, a curious reader can 
trace the earlier references. 

Suppose we examine the number of representations in the simple minded 
style. It takes only a glance to see that the integers with exactly one 
representation are 1 and 2. Those with exactly two representations are 4, 
5, 10, 13, 22, 37, and 58. This could have been proved around 1970 as soon as 
Baker and Stark (separately) found the classical list of imaginary quadratic 
fields with class number 2 to be complete. Of course more can be done now 
that we are up to class number 4, but I leave that story for another day. 
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In closing I heartily thank Noam Elkies. In addition to the computations 
noted above, he ran others that confirmed the correctness (upto 16,383) of 
Theorems 1-4. 
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1. Introduction 


If you stand on the famous Chain bridge in Budapest, you will see below 
you the broad sweep of the Danube. But this broad river arose from the 
confluence of many small streams. Indeed, there is a point near St. Moritz, 
where if a rain drop happens to fall a few centimeters to the north, it will 
make its way into the Rhine, and so to the North Sea. If it falls a little 
to the west, it will join the Adda and the Po, and end up in the Adriatic, 
whereas to the east it would run into the Inn, the Danube, and the Black 
Sea. An apparently negligible movement at the start can make a difference 
of hundreds of kilometers later on. 

We can compare this to the flow of events in life. Each event has its 
own sequence of consequences. The consequences of different events will 
flow together, and evolve to and fro in an unpredictable manner like the 
sinuous bends of a river. But there is one important difference. A river is 
the union of its component streams. If one stream dries up, it will make 
only a trivial difference to the final flow of the river. But an event is the 
intersection of previous events, in the sense that if only one apparently small 
and unimportant component fails, it may make a very great difference to the 
final consequences. 


2. The Erd6és Conjecture 


Here we look at a typical conjecture of Erd6és’s, one which was apparently 
a conjecture of no special importance, and trace the flow of events resulting 
from it and its combination with other flows, to see how, very plausibly, it 
became of considerable importance to us all. 

Around 60 years ago Erdés was a young man in Cambridge. He suggested 
the conjectures 


(a) Any dissection of a square into a finite number of smaller squares must 
contain at least two squares of equal size, 

(b) At least two such equal squares must touch. (We take the word 
“dissection” to have its obvious meaning. To be strictly accurate, we 
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would say that the union of the component squares is the original square, 
and any two component squares intersect, if at all, in part or all of an 
edge. But, in what follows, we will try to present the discussion in a 
simple and obvious way, leaving matters of complete rigor to be supplied, 
if desired, by the reader.) 


This conjecture was noticed by W. R. Dean, then a Cambridge lecturer 
(later a London professor). He sometimes visited Christ’s Hospital (a famous 
boys’ school). On one such visit he mentioned the conjecture, conceivably 
thinking that it might be decided by some of the more capable mathematically 
inclined pupils. At least one of the pupils, Arthur H. Stone, took note of 
the conjecture. Soon afterwards, he gained a scholarship to Trinity College 
Cambridge. 


3. The Cambridge Students 


Among the other schoolboys sitting the scholarship examination was one, 
Cedric A. B. Smith, feeling very miserable. His great ambition was to study 
mathematics at Cambridge. But he had just failed one vital examination. 
And now he was faced with an impossibly difficult exam paper. The situation 
seemed desperate, and he was near to bursting into tears. But at that moment 
someone, rejoicing in a warm sunny summer’s day, walked past, cheerfully 
whistling the Mexican waltz, “Over the Waves.” Life once more seemed worth 
living—perhaps the exam paper was not totally impossible. And if, unlike 
Arthur Stone, Cedric Smith did not get a scholarship, at least he was admitted 
to Trinity, and his ambition was fulfilled. (Incidentally, both he and Arthur 
were told that their applied mathematics was much better than their pure, 
which explains how it comes about that Stone is now a leading topologist.) 

So, not long afterwards, Cedric Smith walked into his very first university 
lecture, on geometry, with great excitement. What new revelations were in 
store? He already knew much geometry—about angles, areas, lines, rectan- 
gles, triangles, circles, conics, poles and polars, tetrahedra, dodecahedra, and 
all that nonsense. But the lecturer mentioned none of that. He added points 
together, and multiplied points by noncommutative numbers. At the end, 
Smith said to the young man next to him, “That was very confusing.” The 
reply was, “Not at all. I thought it was a very good lecture. When is the next 
lecture?” —“At 11.”—“No it isn’t, it’s at 10.” 

So the two walked into a 10 o’clock lecture. They did not know that there 
was a typing error in the timetable, and that it was an advanced lecture. 
They did not know that, because the lecturer’s Russian accent was so thick 
that for 30min they did not understand one word. 

After the lecture, Smith found that his new friend was R. Leonard Brooks, 
the future discoverer of Brooks’s Theorem in Graph Theory (though that was 
not obvious at the time.) 
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Later on, when Mr. Besicovitch produced some weird mispronunciation 
of an English word, and the class roared with laughter, he sternly defended 
himself. “Feefty meelion peeple speek yore kind of Eenglesh. Fife handred 
meelion peeple speek my kind of Eengleesh.” 

Came the Christmas vacation. Smith went out shopping, and met 
someone he thought he had seen in lectures. They both stopped. The other 
young man said: 


“What are you doing here?”— “I live here.” 
There followed a long silence, then Smith asked: 

“What are you doing here?” —‘“T live here.” 
Another long silence, then the other said: 

“T must do the shopping.” 


and walked off. But when they got back to Cambridge, it turned out that 
the other was Arthur H. Stone, and that, like Brooks, he had a room in New 
Court. Smith introduced Stone to Brooks, while Brooks introduced us to a 
student of chemistry, William T. Tutte, also of New Court, saying that Tutte 
was good at chess. 

While we knew that we were real mathematicians, we were still broad 
minded enough to talk to someone who was only a chemist. Tutte put a 
problem to us: find a semipotential function, i.e., a function S(a) such that 


S(S(x)) = exp a. (1) 
We couldn’t. So Tutte said: let a be such that S(a) = a. Then 
exp a = S(S(a)) = S(a) =a. (2) 


For example, we might have a = 0.318 + 1.337%. Differentiate (1) repeatedly, 
and substitute a for x in each relation. We find the successive derivatives of 
the function S(x) at a, and hence its Taylor series. 

This looked plausible, though we weren’t then sure if it converged. Here 
is a sketch of a justification. Write Ing x for InInz, In3 x for InlnInz, and so 
on. If w is near a, the iterated sequence In, x converges geometrically to a, 
with asymptotic ratio In’ a = 1/2. Hence as n increases, the limit 


f(x) = lim[a" (In, x — a)] (3) 
exists, and satisfies f(exp x) = a- f(x), ie., 
exp x= f~'(a- f(2)). (4) 


So S(x), defined as f~+(./af(a)) satisfies (1), and its Taylor series can be 
found exactly as Tutte did. 
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Fig. 1 (a) Morori’s squared rectangle, with horizontal side 64 and vertical side 66. 
(b) An electrical network representing Morori’s rectangle, with total current 64 and 
potential drop 66 


4. Squaring a Square 


Stone then introduced us to the question of whether a square can be divided 
into unequal squares (though he didn’t then know that it came from Erdés.) 
We spent some 3 years working on the problem. 

The first advance was to realize it was easy to find rectangles divided 
into unequal squares, as in Fig. 1a. We drew a rough figure, something like 
Fig. la. Then the conditions that the interior squares fitted together gave a 
set of homogeneous linear equations. We solved these (using suitable short 
cuts). The result was always a rectangle of uniquely defined shape, although 
clearly it could be magnified in size by any constant factor. Unfortunately, 
the two sides were never equal, and never had any very simple ratio. (We 
later found that the rectangle shown in Fig. la had been already discovered 
by Moron [1925].) 

The second stage was to replace the rectangle by an electrical network of 
wires, all of unit resistance, as shown in Fig. 1b. 

Each horizontal line in the rectangle becomes a node in the network. 
Each square becomes a wire (edge) joining the two nodes corresponding to 
the horizontal lines between which the square lies. Each wire carries a current 
equal to the side of the corresponding square. Since the wires have unit 
resistance, this equals the potential drop along the wire. The linear relations 
stating that the squares fit together become Kirchhoft’s (1847) laws: the first 
law, that the total current entering any node must equal the total current 
leaving it, except for the “source” and “sink” nodes at the top and bottom 
respectively, and the second law that the total change in potential round a 
circuit must be zero. 

It follows that the total current is equal to the horizontal side of the 
corresponding rectangle, and the total potential drop between source and 
sink is equal to the vertical side. 
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We can, if we wish, “complete” the network by adding a “battery wire” 
joining the sink and source, containing a battery providing the electromotive 
force necessary to drive the currents through the network, as in Fig. 1b. 
Obviously, we can rotate the rectangle through a right angle, so that 
“horizontal” and “vertical” are interchanged. If we do this we get a new 
network. The relation of this to the former one is simply that the two 
completed networks are topological duals. 

We can again calculate the currents (= sides of component squares) by 
noting that Kirchhoff’s laws provide linear equations, and the solutions are 
given by determinants. But a more interesting way is to use spanning trees, 
following Kirchhoff [1847]. 

For the moment, consider a spanning tree in the network, one not 
including the battery wire, as in Fig. 2. Imagine that only the wires in this 
tree are now conductive: we could imagine the other wires cut. Let a unit 
current flow from source to sink, as in Fig. 2. It will follow a uniquely defined 
path. Repeat for every such spanning tree, and add the currents. We get the 
currents shown in Fig. 1b. 


source --~~ . 


Fig. 2 A unit current flowing through a spanning tree 


To show that in general this procedure gives the network currents, we 
show that Kirchhoff’s two laws hold. Clearly Kirchhoff’s first law holds for 
the current in each spanning tree, as in Fig. 2, and hence in the total. 

To verify Kirchhoff’s second law, we introduce a new construction. The 
trees already considered are those spanning trees in the completed network 
which do not include the battery edge. Now take a spanning tree which 
does include the battery edge, and again suppose that all edges not in this 
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tree are cut. Then the nodes will divide into two classes, those connected 
through the tree to the positive pole of the battery, and those connected to 
the negative pole. To nodes in the first set give potential 1, and to those in the 
second set potential 0. Repeat for all spanning trees containing the battery 
edge, and add the potentials. We get a set of potentials which necessarily 
obey Kirchhoff’s second law. It remains to show that the current in any wire, 
as derived by the first construction, is equal to the difference in potential 
between its end nodes, as derived by the second construction. But that is 
a consequence of the theorem, that if e; and eg are two edges in a graph, 
then a spanning tree including e; but not eg provides in the obvious way one 
including e2 but not e;. Take e; and e2 to be the battery wire and the wire 
in question. 

From these results it follows that the total current flowing through the 
network (= horizontal side of the corresponding rectangle) is equal to the 
number of spanning trees in the network which do not include the battery 
wire. We called this the “complexity” of the (uncompleted) network. The 
total potential drop (= the vertical side of the rectangle) equals the number 
of spanning trees in the completed network which do include the battery wire. 
But spanning trees in graphs are the same as the bases of the corresponding 
matroids. In fact we were dealing with regular patroids [Smith 1972, 1974]. 

A surprise occurred when Brooks cut up a squared rectangle into its 
component squares to form a jigsaw, and challenged his mother to put them 
together again to form a rectangle. She did so, but it was not the rectangle he 
had started with. The phenomenon was explained by Tutte. As an example, 
consider the electrical networks of Fig. 3a, b, corresponding to two rectangles 
made up of the same squares differently arranged. (Both have total current 
= horizontal side 1,025 and total potential drop = vertical side 592). (These 
are not the rectangles tried out on Brooks’s mother, but two new rectangles 
constructed by Tutte.) 

Tutte pointed out that the nodes P, Q, in Fig.3a are symmetrically 
related to A, B, and C, so they have equal potentials equal to the average of 
the potentials of A, B, and C. So one can pick Q up and put it down on top 
of P without changing currents, thus deriving Fig. 3b. 

Tutte now went on to change Fig. 3a-c, by reflecting the interior of the 
triangle in a horizontal mirror, leaving the external connections unaltered, 
obtaining in Fig.3c, d, another pair of rectangles composed of the same 
squares. 

Acknowledgment. Figure 3 was taken (with suitable modifications) from 
Figs. 31 and 32 of J. D. Skinner, Squared Squares: Who’s Who and What’s 
What. With kind permission from the author. 

Now a new surprise, all four networks had the same total currents, and 
total potential drops, so that all four corresponding rectangles had the same 
horizontal and vertical sides, though not all the component squares were 
shared. Tutte explained that as follows. The uncompleted networks 3a and 
3c are topologically isomorphic, hence contain the same number of spanning 
trees, and hence the same total currents. If we add battery wires, then any 


Did Erdés Save Western Civilization? 87 


Fig. 3 Four electrical networks, each representing a squared rectangle with hori- 
zontal side 1,025 (= total current) and vertical side 592 (= total potential drop). (a) 
and (b) correspond to rectangles composed of the same squares differently arranged, 
and so also do (c) and (d) 
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spanning tree including the battery wire and the wire BC in 3a gives rise to 
one including the battery wire and the wire A’B’ in 3c. So the numbers of 
spanning trees including the battery wire are the same in 3a and 3c, so the 
total potential drops are equal. 

Brooks, on the one hand, and Stone and Smith on the other, simultane- 
ously and independently carried these ideas further, and each succeeded in 
producing two rectangles of the same size and shape containing all different 
squares, with the exception that two corner squares were the same. By placing 
these two rectangles so that the equal corner squares were superimposed, they 
each succeeded in producing a square divided into unequal squares, by the 
construction shown in Fig. 4. 

Erdés’s conjecture was incorrect!! (Stone et al. 1940). (But Sprague 1939, 
beat us by a short head.) 


/ 7 


Fig. 4 A method of obtaining a square dissected into unequal squares from two 
squared rectangles having only one square in common, and that at a corner in each 
rectangle 


A number of other workers have since been interested in the problem, and 
the state of investigation up to date is described in Skinner [1993]. However, 
from the point of view of the present discussion, the important fact is that 
by this time that, notwithstanding that we knew that Tutte was a chemist, 
we were extremely impressed by his mathematical ability. 


5. Wartime Developments 


The year 1939 was rather like the present (1993), in that there was severe 
unemployment. The students of Trinity College, Cambridge, organized a 
camp in the Lake District to give some unemployed both a holiday, with 
excursions, and also an opportunity to do some work making a lakeside path. 
One of the participants was a Tutor (later professor) Patrick Duff. But one 
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day it rained hard, Patrick Duff’s tent collapsed, and he was taken into the 
big house to dry. Cedric Smith followed to talk to him. Patrick Duff said: 


“Do you know Bill Tutte?”— 
“Yes.” — 

“We’re very worried. He’s no good” — 
“He got a first class degree.” — 
“His supervisor is disappointed” — 
“Well, he’s very good at maths.” — 
“Prove it.” 


So Smith sent a letter to the College, detailing Tutte’s achievements. But 
Trinity sounded rather unimpressed. 

Quite soon war broke out. Stone had just arrived in Princeton with a 
Visiting Fellowship. He wrote back to the remaining three, saying that he 
had met a Hungarian named Erdo6s, who pronounced English in a Hungarian 
manner, so that “pineapple upside down cake” became “pinnayopp-play 
oopshiday dovn tsockay,” But that did not prevent them collaborating 
mathematically. 

Quite soon Tutte joined a group at Bletchley Park. What they did 
there was a profound secret. But the group contained some of Britain’s 
best mathematicians, and was strongly suspected to dealing with codes and 
ciphers. Why was Tutte asked to go to Bletchley, and not to a chemical 
establishment? Presumably through Smith’s letter, declaring him to be 
an excellent mathematician, even though previously the letter had a cool 
reception. 

For 3 years Smith was a porter at Addenbrooke’s Hospital, Cambridge. 
One day near the end of the war he was cycling past the hospital when he 
saw Prof. Duff, and waved. Prof. Duff shouted “Stop! STOP!! S T O PII!” 


“What’s the matter!!??” — 
“Do you know Bill Tutte’s address?” — 
“Yes, Why??” 


“We've been told to elect him to a Fellowship at Trinity. But they 
didn’t tell us why, or what his address is. So we can’t send a telegram 
to congratulate him.” 


The past may now be only a memory. But during the war the Germans 
had occupied most of western Europe, so that Britain had to get all vital 
supplies across the Atlantic, constantly menaced by German submarines. 
The situation was not at all good. It would have helped to be able to 
decode German messages. The Poles had secretly examined a German coding 
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machine as it was transported across the Polish Corridor, so the coding device 
was known. It was simple, but fiendishly clever, and was found very difficult to 
decode by even the top mathematicians at Bletchley. Very plausible rumor 
says that Tutte supplied the vital clue, possibly avoiding military defeat. 
The consequences of a Nazi victory would have been horrific—civilization 
was saved. And it all began with a conjecture by Erdés! 


6. Postwar Developments 


To round off this story, we may briefly say what eventually became of the 
various dramatis personae. W. R. Dean became Professor at University 
College London. Leonard Brooks became an Income Tax Inspector—but 
continued to develop mathematical ideas. Arthur Stone was another Fellow 
of Trinity, and after a time at Manchester, he and his wife Dorothy both 
became Professors of Mathematics at the University of Rochester, NY. Bill 
Tutte, now truly a mathematician, via Toronto became Professor at Waterloo, 
then Fellow of the Royal Society of Canada, Distinguished Professor, Fellow 
of the Royal Society of London, and Founding President of the Institute of 
Combinatorics and its Applications. Not bad for only a chemist! 

As for Cedric Smith, he heard that the Galton Laboratory needed a 
statistician. (The Galton lab is the human genetics section at University 
College London.) This seemed an interesting possibility for a first academic 
post. He applied, and was interviewed by Prof. Lionel Penrose, and by Prof. 
J. B. S. Haldane, who greeted him with “you know why you’ve come here. 
Prof. Penrose thinks there might be a job for you. But I don’t think so.” 
Nevertheless it did turn out to be a reasonable first job, for Smith is still 
there after 47 years. 

This gave him a good chance to get to know the famous Erdés in person. 
Thus, after he had got married, his mother-in-law complained of the phone 
ringing and a deep voice (guess who) saying, “How are you? Vair is your 
slave? Is he preaching?” (in conventional English = Where is your husband? 
Is he lecturing?). Said she, indignantly, “I haven’t got a slave.” 

The University of London ran a series of seminars. Erdés attended every 
One. That is to say, he sat there for the first 15 min, with a look on his face as 
if to say “I know all that.” Then he walked out. There came the time when 
Erdés himself was the speaker. Could he walk out on himself? It seemed 
improbable. We waited to see. After 15min he turned, faced the audience, 
said “Rado can explain all this better than I can.”, and sat down. 

Later there came an invitation from the Bolyai Mathematical Society 
to attend a combinatorial conference in Hungary, and so to see Erdos in his 
own town. Smith and Tutte went to the Hungarian Legation in London to get 
visas. “What can we do for you gentlemen?” asked the official— “We want to 
go to a mathematical conference.” — “We know all about you, gentlemen. We 
can’t give you visas.” — “Why not?”—‘“Permission has not yet come through 
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from Budapest.” —“Well,” said Smith, “if you don’t give me a visa now, I’m 
not going.” Anxious phone calls followed, then the official said, “You are very 
fortunate, gentlemen. Permission has just come through from Budapest.” 

Smith took the visa home, then looked at it. As he remembers, it said 
something like “Egy bemenet, semmi kimenet.” He phoned his father-in-law 
and asked, “Doesn’t that mean ‘one entry, no exit?’”—“Yes. What about 
it?”—“T would like to come back.”—“Well, if you’re worried, [’ll phone the 
Hungarian legation.” They replied, “We are not mathematicians. We are 
only lawyers. We say that if you go in you must come out. What’s worrying 
you?” 

So next day Smith found himself, feeling hungry, in a train crossing 
Belgium. He went to the dining car. Opposite him sat someone reading a 
German chemical treatise. Smith speaks little German, and stayed silent. 

However, when Smith held up a banknote to pay the bill the German 
leaned over and said “C’est trop.” Smith tried to say, in schoolboy French, 
that he only wanted change. The Frenchman asked, “Quelle est votre nation- 
alité?”, to which Smith replied “Anglais.” There was a long silence, which 
did not seem too polite, so Smith asked the Frenchman, “Quelle est votre 
nationalité?” —“Oh, ’m American. ‘What are you going to Frankfurt for?” — 
“[’m not going to Frankfurt.” “Then where are going?” —“To Budapest.” — 
“Budapest?”, said the American, “Budapest? that’s where I was born. 
What are you going to Budapest for?”—“A mathematical conference.” —“A 
mathematical conference”, said the Hungarian, “I know some mathemati- 
cians. Do you know a mathematician called Erdés?”—“Everyone knows 
Erdos.” —“His father was my mathematics schoolteacher.” —“He will be at 
the conference.”—“Give him my regards. Do you know a mathematician 
called Stone?”—“There are lots of mathematicians called Stone. But the 
only one I know went to America and married a mathematician.” — “Was her 
name Margaret?”—“No, her name is Dorothy.”—“Then that must be the 
Stone I know.”—“He will be at the conference.” “Give him my regards.” 

At Wien Smith found an old second class carriage, more decrepit than 
anything he ever remembered seeing, labeled “Orient express, Budapest, 
Bucuresti”, and drawn by an ancient steam engine. 

When it got to Hegyeshalom they put an electric locomotive onto the 
train, and it rushed through the darkness until it cam to a hill of lights, and 
a broad river. Itt a nagy szép hires fovaros Budapest. Ilyen boldogsdg!! But 
what to do next? 

Here he was in the home town of not only Erdés but also of very many 
other famous mathematicians. Dr. Suranyi, the Society Secretary met him 
and took him to his hotel. A group of mathematicians were there, possibly 
including Erdés. At the next table sat someone wearing a St. Johns College 
Cambridge tie. Smith nervously asked him “Don’t I know your tie?”—‘“No, 
you don’t. You think it’s a St. Johns tie. It isn’t. It’s my old school tie. The 
stripes are narrower.” Smith looked up. There was Gabriel Dirac, come for 
the conference, wearing a genuine St. Johns tie. The ties were compared. 
There was no visible difference. 
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The conference went wonderfully well, with Hungarian hospitality and 
organization. But that is a story for another occasion. 

Erdoés is not only one of a line of most distinguished mathematicians 
coming from Hungary. He must also be one of the most respected, and one 
treated with the greatest affection by all mathematicians. It is both a great 
pleasure and a great honor to greet him on his 80th birthday, “much love to 
Uncle Paul”, 


“Sok szeretettel a Pali Bacsikanak!!” 
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1. Encounters with Paul Erdés 


My first encounter with Paul Erdés was curiously indirect. As a pre- 
undergraduate at Cambridge (England) in 1934, I learned from one of 
the Trinity College tutors that a mathematician named Erdés, passing 
through Cambridge, had mentioned an intriguing conjecture (attributed to 
Lusin, I believe), implying that a square could not be dissected into a 
finite number of unequal smaller square pieces. I passed this problem on 
to three fellow students, and we eventually found methods that produced 
counterexamples [1]. Of recent years the advent of high-speed computing has 
given rise to a considerable industry listing large numbers of dissections of 
squares into unequal squares ([2] and [6] for example), an industry that could 
continue indefinitely as there are infinitely many different dissections of this 
kind. 

I first met Erdés a few years later (around 1940') at Princeton, where I 
was a graduate student. Nearly all the mathematicians there were friendly 
and approachable, but Erdés excelled them all in that he was always willing 
to listen, with attention and encouragement, to other people’s mathematics, 
even to lowly graduate students. He listened to my thesis results as they 
emerged, as he had listened to my wife’s (already completed) thesis. This, we 
found, was and is typical of him. 

When I next met Erdés we were at Purdue (1942?7), during World War II. 
His arrival there, for a stay of some months, began a little inauspiciously. 
Before setting out for the mathematics department, he had engaged a room 
and left his luggage there. At the mathematics department he found he 
needed something from his room, and Douglas Olds, of the Purdue faculty, 
volunteered to walk there with him. As Professor Olds told it, they walked 
and walked and walked, until he asked Erdés “Is it much further?” “Fascism 
Stalinism!” was the reply “I thought you were taking me there”. 

As this indicates, Paul was (and is) strongly interested in politics, with 
rather liberal views. He disapproved both of Communism as practiced in 


1 P. Erdés remarks: autumn 1939 
2 P. Erdés remarks: 1943 
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Joedom and of some of the American reactions to it. “Joe” and “Sam” often 
rated disapproving remarks. He also disapproved, in general, of Providence, 
which he would refer to as the “S. F.” (Supreme Fascist). At that time he 
espoused the cause of China—not then “Red China”—and used to raise quite 
a bit of money for this cause by volunteering to drink small quantities of 
“poison” (i.e., alcoholic beverages) for so many dollars for China. (Nowadays 
the U.S. custom for raising money for good causes is for volunteers to walk 
so many miles at so much per mile, paid for by other volunteers. This at least 
encourages a healthier lifestyle.) 

Naturally we continued to discuss mathematics, and wrote a couple of 
joint papers [3, 4]. Paul’s method for writing joint papers was, of course, for 
him to convey just the essence of his share of the argument; it was up to the 
co-author to write the actual paper. (I believe Alexandre Dumas pére used 
a somewhat similar system.) Unlike many mathematicians, he then showed 
little interest in music; but there was one record (a “78” of course) I had that 
he was very fond of—Myra Hess playing her piano arrangement “Jesu Joy of 
Man’s Desiring” from Bach’s cantata. It is a beautiful piece, and he would 
often ask to hear it. 

My later encounters with Paul Erdés have been somewhat hit-or-miss; 
for example, last Spring my wife and I left our home near Boston to attend 
a conference in Columbus at which he was expected to speak, but instead 
he gave a talk near Boston precisely during our absence. As someone said, 
the Heisenberg uncertainty principle applies to him; you cannot determine 
simultaneously his position and his velocity. We have sometimes successfully 
attended the same conference—for example a combinatorial one in Calgary, 
Canada (1969), resulting in [5], and one on “Real Analysis” in Smolenice 
(1991). And we have corresponded, the usual Erdés letter beginning “Dear 
Stone, Let p1, po,...,Pn be distinct primes” (or points or whatnot), and 
listing some interesting but (to me) hopelessly difficult conjectures. 

In addition to the foregoing traits, here are two more that even a casual 
acquaintance with Paul would reveal. First, he delighted in small children 
and infants. (They were the “superbosses” , in his terminology, outranking 
older females who were merely “bosses”.) Second, it may be observed that 
I have been a bit vague about some dates. But he would remember them 
exactly. He remembers events of interest to his friends, too; as someone said, 
he remembers the incidence matrix. 


2. Some (Very) Elementary Number Theory 


Theorem 1. The two Diophantine equations 


x = Am +1 


have at most two nontrivial solutions between them. 
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Here \ > 1 and m > 2 are given integers, and for “nontriviality” we 
require the unknown integers x and y to be greater than 1. 

It is convenient to refer to the equations as (*, ++) and (*, —) respectively, 
and to abbreviate “nontrivial solution” by “solution”. Thus the assertion is 
that (*,+) has at most 2 solutions. 


Proof. Write (*,+) as the pair of Pell equations x? = juz? +1 where, if y is 
odd, fp = Am and z = mY—Y/?, and if y is even, pp = Am? and z = m(Y-2)/2, 
In either case we have m | w (so w > 1) and z > 1. We can assume that pu is 
not a square, else there are no (nontrivial) solutions. 

If the equation z? = yz* — 1 has solutions, let (1, 21) be the smallest 
positive one. If not, let (#1, z1) be the smallest positive solution of x? = yz?+1 
(which certainly exists). In either case, note that (x1,m) = 1. It is well known 
(see e.g. [10]) that all (nontrivial) solutions of x? = yz? +1 are of the form 
(Ly, Z-) (r = 1,2,...) where 


1/2 


Bp + plz, = (ay + pz)", tp — zp = (a — 221)”. 


Thus z, can be expressed as the finite sum 


r r r 
Zp = (1) at tz t+ (5) a 333 a + (*) oP 23? +... (1) 


Lemma 1. If rz is a multiple of m* for some nonnegative integer t, and if 


(m,3) =1, then (for s =1,2,..., [ep ee is a multiple of m'+}, 


Let p be a prime factor of m, say with multiplicity a > 1, and first suppose 
p = 5. In the expression 


r P r(r — 1) ++ (r= 2s) | : 
=> m 
yr el Posen 


count the number of occurrences of p as a factor. In the numerator, the factors 
rzym* provide a(t + s) occurrences of p. In the denominator, the number is 


2s+1 2s+1 _ = 
PH, 2) care nngrt tet te) 


which is less than +(2s + 1) < s. So ( 
GB=a(t+s)—(s—1)>a(t+1). 

In the remaining case p = 2, the same argument shows that the number of 
occurrences of 2 as factor in the denominator is at most 2s. In the numerator, 
besides a(t+s) occurrences from rz;m*, s of the factors r—1,r—2,...,r—2s 
are even and (assuming for the moment that s > 2) at least one of them is 
a multiple of 4. So (24) (where s > 2) is a multiple of 27 where 
y=a(t+s)+(s+1)—2s = 6 > a(t+1) as before. 

But if s = 1, (osu eumne® = ree) 2m, a multiple of 2° where 
(because (r — 1)(r — 2) is even) 6 > a(t+ 1); and Lemma 1! follows. 


2s+1 
p-1 


Pa ce is a multiple of p® where, 
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Remark 1. The lemma would fail if p = 3 were allowed; for instance (with 
r=6, 21 =5,m=83, 8 1) 6-5 is a multiple of 3 but (§) -5+3 is not a 
multiple of 9. 


Lemma 2. Jf x > 2 and z > 1 are integers such that 2? = p27 +1, and 
if z is a power of m and (m,3) =1 andm | p, then (given 4) x and z are 
uniquely determined. 


We know x = 2, and z = z, for some r > 1. First suppose x; > 1; we 
show that r = 1. Observe that r is necessarily odd, for otherwise (1) shows 
that x1 | z,; but (#1, m) = 1, contradicting that z is a power of m. So we can 
assume, for a contradiction, that r > 3. There are now at least 2 terms on 
the right of (1). Let m* be the highest power of m dividing the first of these, 
namely (")a{~'z1. By Lemma 1 the other terms are multiples of m't!. Thus 
zy is greater than m' but not divisible by m‘t?, again contradicting that z is 
a power of m. 

Finally, if 7; = 1, we must have yu = 2 and the trivial “solution” (21, 21) 
has been excluded here because « > 2. The same argument as before shows 
that r < 3, so necessarily r = 2. 

To deal with the case in which 3 | m, we first show that Lemma | continues 
to hold provided 9 | m and s > 1. We state this as 


Lemma 38. If rz is a multiple of m* for some nonnegative integer t, and if 
9|m, then (for s = 2,3,..., [)), (oy 1) 1m is a multiple of m'+?. 


In view of Lemma 1 it suffices to check the occurrences of the prime 3 
nO) ms where s > 2. In the numerator there are a(t + s) from 
rzym*, as before, together with at least one from (r — 1)...(r — 2s). In 
the denominator, as before, there are fewer than (2s +1)/2, hence at most s. 
Altogether (og eure is a multiple of 3° where 6 = a(t+s)+1—s > a(t+1), 
as before, and Lemma 3 follows. 


A modified form of Lemma 2 now holds: 


in 


Lemma 4. If x > 2 and z > 2 are integers such that x? = uz? +1, and if z 
is a power of m, and 3|m and 3m | pu, then (given uw) x and z are uniquely 
determined. 


As in Lemma 2 we have that x = x, and z = z; for some positive integer r. 
We show that r must be just one of 1 and 3. 

Define ¢ as in Lemma 2, and first suppose r > 5. There are then at least 
3 terms on the right of (1). By Lemma 3, all but the first 2 of these are 
multiples of m'+!. The sum of the first two can be written 


: 1 —1 
re, °zyju where u= suet((" 9 ) + 3) + 


Now we observe that (u,m) = 1. In fact, if p is a prime factor of m other 
than 3 we clearly have p | (u+ 1), so p is not a factor of u. And if p = 3 this 


T 
eR 
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still holds because 9 | yu. Since (a1,m) = 1 also, we see that the sum of the 
first two terms in (1) is a multiple of m* but not of m‘*+, contradicting that 
z is a power of m. 

As in Lemma 2, r cannot be even here, so r can only be 1 or 3. We show 
these alternatives are mutually exclusive. 

Suppose not. Then both z3 = 3721 + zu, and 21, are powers of m; so 
their quotient, 3x? + z?j, is also a power of m, and in particular a multiple 
of m. Hence m | 327; but (m,z1) = 1, so m | 3, forcing m = 3. We also 
have x? = juz? +1 (the alternative —1 being excluded here because it is not a 
quadratic residue mod 3), so 327+ 22 = 4.27 +3 is a power of m = 3, and 
so also is (427 +3)/3 = 12(u/9)z? + 1. This is impossible, as it is greater 
than 1 and congruent to 1 mod 3. 

The theorem now follows readily from Lemma 2 when m is not a multiple 
of 3, and from Lemma 4 when 3 | m by the following modification of the 
initial transformation to Pell equations. Put 


u=Am® ifyisodd, Am? if y is even 
and 


z= m-9)/2 or mY-?)/? respectively. 


Then the equations (*, ++) reduce to x? = wm?+1 (the — sign being impossible 
now because 3 | 4). In each case, Lemmas 2 and 4 show that, if (nontrivial) 
solutions for « and z exist, they are unique, given yp, As there are two 
possible values for ju in each case, there are at most two (nontrivial) solutions 
altogether for (*, +). 


Remarks 


(i) For certain values of X and m there actually are two nontrivial solutions 
to (+, +); for instance, 5? = 3-29 +1, 7? = 3-24+41; and 117 = 15-23+1, 
31? = 15-2®+41. There can also be additional trivial “solutions”, for 
instance 2? = 3-2°+1 and 44 =15-2°+1. 

(ii) From the way in which the two possible values of y arise, if there are 
two solutions to (*, +), one will have an odd value of y and the other an 
even one. 

(iii) If A is a square, the equations (*,+) can have at most one solution 
between them, for y must be odd and there is only one value of pw to 
consider. 

(iv) The equation (*,—) can have solutions only if —1 is a quadratic residue 
mod Am. But I do not know whether («,—) can then have more than 
one (nontrivial) solution, nor whether the two equations (*, +) can have 
one solution each. 

(v) The equations z? = 10¥ +1 have no solutions other than the trivial 
3? = 10! — 1. For, as in (iii), we have only one value of jz to consider; 
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namely yp = 10; and then 2; = 3, 21 = 1. (Thus the numbers 
100...001, 11...111 in the usual scale of ten, and of more than one 
digit, are never squares.) Of course, much more than this is known; 
see [9]. 


(vi) Background A problem in the American Mathematical Monthly (E 2511, 


January 1975, p. 73), due to M. Olitsky, reduces to solving the 
Diophantine equations x? + 1 = 5¥, 27 + 1 =2-5¥. The solution (ibid., 
April 1976, p. 291) mentioned references [7-9] below. And the equations 
suggested the present generalization to 7? = \m¥ +1. 
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It is an honor to be asked to contribute a paper to so historically important a 
collection. Yet it can be embarrassing too. In my case I ask distractedly 
“What can I write about? The researches I have completed have been 
published already, or at least have been submitted to Journals. The work I 
am engaged upon is incomplete, may be anticipated, perhaps even fallacious. 
And what else can there be?” 

But perhaps I can say something about what I am doing now, mathe- 
matically, though my discourse may be more speculative than demonstrative. 
Something about how it came to interest me and what I hope to get from it. 

It seems that someone once conjectured that every cubic graph without 
a Tait coloring contains a subdivided Petersen graph (or is a Petersen graph 
itself). Some time last summer (1994), this conjecture having come up several 
times in conversation, I reacted against it. “So what?” I asked, “Probably 
every sufficiently nonplanar cubic graph has that property!” 

On later reflection I realized that in this hasty dismissal I had another 
conjecture, and even one that a graph-theorist might reasonably hope 
to prove. 

Over the next few months that conjecture, having persisted in my 
thoughts, changed its form somewhat and acquired some qualifications. 
I recalled that the Petersen graph was famous as the smallest cubic graph of 
girth 5. I also knew of cubic graphs of girth less than 5 that obviously did 
not contain a Petersen graph. So I began to think of the new conjecture as 
being about cubic graphs of girth 5 or more. 

After these months of incubation the conjecture reformulated itself as the 
problem of classifying the “critical” cubic graphs. A critical cubic graph was 
to be defined as one of girth >5 that contained (in subdivision) no other cubic 
graph of girth >5. Obviously any finite cubic graph of so high a girth would 
either be critical itself or would contain a subdivided critical cubic graph. 
One hoped there would not be too many critical cubic graphs. Perhaps only 
the Petersen graph and the graph of the regular dodecahedron? 

There was an elementary theorem about critical cubic graphs that seemed 
relevant to the problem of classification. Suppose we have a cubic graph G of 
girth at least 5 and delete an edge A, suppressing its now-divalent ends to get 
a small cubic graph H. Then H also has girth at least 5 unless A “impinges” 
on a pentagon P of G. “Impinges” means that A is not an edge of P but does 
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have one end in P. Since G contains a subdivision of H we infer that if G is 
critical each of its edges must impinge upon a pentagon of G. 

From this theorem there developed a theory of “semicubic” subgraphs 
of a critical cubic graph G. (A graph is semicubic if each of its vertices has 
valency either 2 or 3). Let us define a “free” edge of a semicubic graph J of 
girth 5 as one that impinges on no pentagon of J. Let us say that such a graph 
J is “open” if it has a free edge and “closed” otherwise. Given such a J with 
a free edge A it was found possible to make a list of all the semicubic graphs 
of girth 5 that could be obtained from J by uniting it with a pentagon that 
was impinged upon by A. Then one had a theorem saying that if a critical 
graph G contains J then it must contain also one of those more complicated 
derivatives of J. Each of these derivatives, if open, had to be treated in the 
same way, and so on. The end result was a list of closed semicubic graphs 
such that each critical graph G must contain as a subgraph some member of 
the list. 

Getting this theorem was a long and tedious process. Getting from it to 
the desired classification was a long and arduous task. “O Murphy” I groaned, 
“Wherefore doth thy Law throw me always the hard ones?” 

In the course of the investigation I encountered three more critical cubic 
graphs that I thought interesting. One consisted of a nonagon and three 
outside vertices each joined to the nonagon by three edges. Another could 
be derived from a cube by the following construction. Two opposite sides of 
one face are bisected and the points of bisection are joined by a new segment. 
The opposite face of the cube is treated likewise, but so that the two new 
segments are perpendicular. Then the two new segments are bisected and 
their mid-points are joined by a new edge. 

Another critical cubic graph can be constructed from a rectangle with two 
long sides and two short ones. Inside it, as a frame we draw two pentagons 
having one edge in common. Their vertices opposite the common edge are 
joined to the mid-points of the short sides of the rectangle. Their other 
vertices are joined to the points of trisection of the long sides. All these 
joins are such as to preserve planarity. Finally each pair of opposite corners 
of the rectangle are joined by a new edge, to give a critical cubic graph of 
crossing number 1. 

After adding these three graphs to the Petersen graph and the dodecahe- 
dral graph I now had a list of five “interesting” cubic graphs. I called them 
the Z-graphs, and noted that they were all cyclically 5-connected. 

There was a gaggle of other critical cubic graphs that I thought less 
interesting. Each of these had an isthmus or a 2-bond, or a 3-bond that 
separated circuits. Perhaps one could, in a way, eliminate these by showing 
that a cubic graph K of girth at least 5 and of sufficiently high cyclic 
connectivity must contain a subdivision of a Z-graph? I an now satisfied that 
there is such a theorem. It requires only that K be cyclically 4-connected. 
More hard work, and with many false starts! 
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Work on the original conjecture cannot be regarded as complete until we 
have an extensive and simply defined class of cubic graphs that must contain 
(in subdivision) a Petersen graph and not merely “some Z-graph” ). Perhaps 
there is a hint of such a theorem in the fact that the Petersen graph is, in 
a way, more nonplanar than the other Z-graphs. The dodecahedral graph is 
planar. The one derived from a rectangle has crossing number 1. The one from 
a cube can be made planar by deleting one edge, and the nonagonal one by 
identifying two vertices. But the Petersen graph has crossing number 2 and it 
cannot be made planar by either of those two tricks. If sufficiently high cyclic 
connectivity in K ensures the appearance (in subdivision) of a cyclically 
5-connected Z-graph, then perhaps sufficiently complex nonplanarity will 
ensure the appearance of a Petersen graph, that most nonplanar of the Z- 
graphs? 

Meanwhile I am happy to present some of the facts in a book dedicated 
to the Master. 


Il. Number Theory 


Introduction 


It is difficult to estimate the relative impact of Erdés’ research in different 
areas of mathematics. But it is a fact that Erdés started with number theory 
(e.g., out of his first 60 papers only 2 are not related to number theory) 
and that among his publications, the number theory papers have highest 
frequency. His achievements are well known and are amply mirrored by 
contributions to this chapter (which is the largest of all the chapters of these 
volumes). 

The papers by Ahlswede and Khachatrian, Konyagin and Pomerance, 
Nathanson, Nicolas, Schinzel, Shorey and Tijdeman, Sarkozy and Sds, and 
Tenenbaum survey and relate to various parts of Erdés’ research, and they 
complement in various respects his own recollections given in an earlier 
chapter. 

Some of these papers are research articles, such as the papers by Ahlswede 
and Cai, Sarkozy, Tenenbaum and Bergelson et al. (which includes Erdés 
himself as a coauthor). 

Although we believe this is a representative sample of Erdés’ activities 
in this area, many problems and particular research directions are not 
covered. The reader should bear in mind that Erdés himself considered the 
probabilistic methods in number theory together with his work on prime 
numbers as his main contributions to number theory. Probabilistic methods 
are covered by the next section as well. 

In 1995/1996, when the content of these volumes was already crystallizing, 
we asked Paul Erddés to isolate a few problems, both recent and old, for each 
of the eight main parts of this book. To this part on infinity he contributed 
the following collection of problems and comments. 
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Erdos in his own words 

Here is a purely computational problem (this problem cannot be attacked 
by other means at present). Call a prime p good if every even number 2r < 
p—3 can be written in the form q, — gg where q; < p, gg < p are primes. 
Are there infinitely many good primes? The first bad prime is 97 I think. 
Selfridge and Blecksmith have tables of the good primes up to 10?” at least, 
and they are surprisingly numerous. 

I proved long ago that every m < n! is the distinct sum of n — 1 or 
fewer divisors of n!. Let h(m) be the smallest integer, if it exists, for which 
every integer less than m is the distinct sum of h(m) or fewer divisors of m. 
Srinivasan called the numbers for which h(m) exists practical. It is well known 
and easy to see that almost all numbers m are not practical. I conjectured 
that there is a constant c > 1 for which for infinitely many m we have h(m) < 
(log log m)°. M. Vose proved that h(n!) < en‘/?. Perhaps h(n!) < c(log n)@. 
I would be very glad to see a proof of h(n!) < n°. 

A practical number n is called a champion if for every m > n, we have 
h(m) > h(n). For instance, 6 and 24 are champions, as h(6) = 2, the next 
practical number is 24, h(24) = 3, and for every m > 24, we have h(m) > 3. 
It would be of some interest to prove some results about champions. A table 
of the champions < 10° would be of some interest. I conjecture that n! is not 
a champion for n > no. 

The study of champions of various kinds was started by Ramanujan 
(Highly composite numbers, Collected Papers of Ramanujan). See further 
my paper with Alaoglu on highly composite and similar numbers and many 
papers of J. L. Nicolas and my joint papers with Nicolas. 

The following related problem is perhaps of some mild interest, in 
particular, for those who are interested in numerical computations. Denote by 
gr(n) the smallest integer which is not the distinct sum of r or fewer divisors of 
n. A number n is an r-champion if for every t < n we have g,(n) > g,(t). For 
r = 1 the least common multiple M,, of the integers < m is a champion for 
any m, and these are all the 1-champions. Perhaps the M,, are r-champions 
too, but there are other r-champions; e.g., 18 is a 2-champion. 

Let fx(n) be the largest integer for which you can give f;,(n) integers 
a; <n, for which you can not find & + 1 of those that are relatively prime. 
I conjectured that you get f,(n) by taking the multiple < n of the first & 
primes. This was proved for small k by Ahlswede, and Khachatrian disproved 
it for k > 212. Perhaps if n > (1+€)p?, where p, is kth prime, the conjecture 
remains true. 

Let ny < ng <... bean arbitrary sequence of integers. Besicovitch proved 
more than 60 years ago that the set of the multiples of the n; does not have to 
have a density. In those prehistoric days this was a great surprise. Davenport 
and I proved that the set of multiples of the {n;} have a logarithmic density 
and the logarithmic density equals the lower density of the set of multiples 
of the {n;}. Now the following question is perhaps of interest: Exclude one 
or several residues mod n; (where only the integers > n; are excluded). 
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Is it true that the logarithmic density of the integers which are not excluded 
always exists? This question seems difficult even if we only exclude one residue 
mod n, for every n;. 

For a more detailed explanation of these problems see the excellent 
books by Halberstam and Roth, Sequences, Springer-Verlag, and by Hall and 
Tenenbaum, Divisors, Cambridge University Press. 

Tenenbaum and I recently asked the following question: let ny < ng <... 
be an infinite sequence of positive integers. Is it then true that there always 
is a positive integer k for which almost all integers have a divisor of the form 
n; + k? In other words, the set of multiples of the nj +k (1 <i < co) has 
density 1. Very recently Ruzsa found a very ingenious counterexample. 

Tenenbaum thought that perhaps for every ¢ > 0 there is a k for which 
the density of the multiples of the n; +k has density > 1 —e. 

In a paper (Proc. London Math. Soc. (1970) dedicated to the memory of 
Littlewood) Sark6ézy and I state the following problem: Let 1 < a; < ag < 
wes <Gnq2 < 3n be n+2 integers. Prove that there always are three of them 
a, < aj < ax for which a; +a, =0 (mod aj). The integers 2n < t < 3n show 
that n+ 1 integers do not suffice. 

Perhaps a proof or disproof will be easy. As far as I know, the problem 
has been rather forgotten. 

Many more problems are contained in the book P. Erdés and R. L. 
Graham, Old and New Problems and Results in Combinatorial Number 
Theory, the second edition of which should appear soon. 
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So much for Paul Erdés. The progress on his problems since 1995/1996 
has been considerable and many of his results and problems became the 
subject of intensive study. For example research on sum sets (reported in 
this book in the Nathanson article) led to the celebrated Freiman—Ruzsa 
theorem, which in turn has been instrumental in the Green—Tao proof of 
arithmetic progressions in primes (see the Ramsey theory chapter in Vol II). 
For sum sets, see the comprehensive book: 

I. Z. Ruzsa, Sumsets and Structure. In: Combinatorial Number Theory 
and Additive Group Theory (A. Geroldiner, I. Z. Ruzsa, eds.) Advanced 
Courses in Mathematics CRM, Birkhauser, 2009, 87-210. 

In another, yet broadly related, direction, the celebrated sum—product 
theorem of Erdés and Szemerédi became the basis for many similar results 
dealing with the sum—product phenomenon in both the finite and infinite 
setting. This in turn has found many applications. 

P. Erd6és, E. Szemerédi, On sums and products of integers, Studies in 
Pure Mathematics, Birkhauser, Basel, 1983, 213-218. 

T. Tao, The sum-product phenomenon in arbitrary rings, Contributions 
to Discrete Math. 4,2 (2009), 59-82. 
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J. Solymosi, Incidence and Spectra of Graphs. In: Combinatorial Number 
Theory and Additive Group Theory (A. Geroldiner, I. Z. Ruzsa, eds.) 
Advanced Courses in Mathematics CRM, Birkhauser, 2009, 299-314. 

It is only fitting to remark that these questions are close to those 
considered by L. Guth article in Part IV devoted to geometry. The progress 
in combinatorial number theory in the last decade was spectacular. Perhaps 
P. Erdos didn’t expect that. Here is an evidence of this: He mentions in the 
introduction of his article in this volume (Some of my favourite problems and 
results) an anecdote about Hilbert’s wrong estimation on what is a difficult 
problem. Well the same happened to him: In the above article he states that 

“An old conjecture in number theory states that for every k there are k 
primes in an arithmetic progression. This problem seems unattackable” . 

Well as we know the opposite is exactly what happened. But the Erdés 
related conjecture is still open and (quoting him again) “neither Szemerédi 
nor Furstenberg” (nor Gowers nor Green and Tao) “are able to settle this.” 

For the general development of combinatorial number theory, see recently 
published book: 

T. Tao, V. Vu, Additive Combinatorics, Cambridge University Press, 
2006. 

Note also that since 2000 there has been published the electronic journal 
of combinatorial number theory, Integers, which has P. Erdés as part of its 
logo (www.integers-ejcnt.org). 
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Summary Let Z, be the lattice of intervals in the Boolean lattice £,. For 
A,BCT, the pair of clouds (A, B) is cross-disjoint, if 1N J=@ for l¢ A, JE B. 
We prove that for such pairs |A||B] < 3?”~? and that this bound is best possible. 

Optimal pairs are up to obvious isomorphisms unique. The proof is based on a 
new bound on cross intersecting families in £, with a weight distribution. It implies 
also an Intersection Theorem for multisets of Erdés P, Schénheim J (1969) On the 
set of non pairwise coprime division of a number. In: Proc. of the Colloquium on 
Comb. Math. Dalaton Ftired, pp 369-376. 


1. The Results 


Consider the set [n] = {1,2,...,n}, the set of all its subsets £,, and the 
lattice of intervals Z, = {I = [A,B]: A,B € Ly}, where [A,B] ={CeEL,: 
ACCC B}, if AC B, and [A, B] = Ig (the empty interval), if A ¢ B. The 
lattice operations A and V are defined by 


[A, B] A [A’, B’] = [A,B] [A’, BY, (1) 
[A, B] v [4’, B’] = [An A’, BUB’). (2) 


Here the empty interval Ig, is represented by [[n], 0]. The pair (A, 8B) with 
A,B CT, \ {Ip} is cross-disjoint, if 


IA J=TIg forl eA, JeEB. (3) 
Let us denote the set of those pairs by Dy. 
Theorem 1. Forn=1,2,... 
max{|A||B] : (A,B) € Dy} = 3?"-2. 
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Equality is assumed for 
A*={leT,:I=[A,Bl1¢ Bs, Be ={leT,:I=[A,B],1€ A}. 


All optimal pairs are obtained by replacing 1 in the definition of A* and 
B* by any element m of [n|, and by exchanging the roles of these two sets. 


We shall relate cross-disjoint pairs of clouds from TZ, to cross-intersecting 
pairs of clouds from £,, with a suitable weight. 
Recall from [1] that (U,V) with U,V C L,, is cross-intersecting, if 


UNV £0 forU €U and V EV. (4) 


We denote the set of these pairs by P,,. Furthermore we introduce the 
weight w: Lp, —- N by 


w(A) = 2”-lA4l for A € Ly. (5) 
Theorem 2. For (U,V) € Pn 
WWU)W(V) = S° wY)- SO w(V) < 32-9) 
Ucu vev 


and the bound is best possible. Moreover, for any optimal pair (U,V) there 
exists at € [n] such thatU =V={AEL,:te€ A}. 


2. Another Description for T,,\ {Io} 


We associate [A,B] € TZ, \ {Ip} with a ternary sequence W([A,B]) = 
(a1,22,...,2n), where 


0 ift¢B 
r= 1 ifte A 
2 ifteB\A. 


Ww: TZ, \ {Ip}  {0,1, 2}” is bijective. 
If W([A, B]) = 2” and U([{A’, B’]) = y”, then 


[A, B] A [A’, B’] = Ig & St € [n] : {21, yx} = {0, 1}. (6) 


For (A,B) € D, the associated pair (¥,Y) = (U(A), V(B)) has the 
property: 


For a” €X,y" EV {ax1, yz} = {0,1} for some t € [n]. (7) 


We can view (4, V) as families of cross-disjoint subcubes of the n-dimensional 
unit cube or as families of cross-disjoint cylinder sets in {0,1}” in the sense 
of measure or probability theory. In this interpretation 2 stands for the set 


{0,1}. 
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Henceforth we consider pairs (Vv, Y); ¥,Y c {0,1,2}"; satisfying (7). We 
call them cross-disjoint. The set of these pairs is denoted by D*. Our first 
goal in proving Theorem 1 is to show that for (4, ) € D* 


Zipisor™. (8) 


3. Down-Up-Shifts 


The proof of Theorem 1 goes in several steps. At first we show here that 
any (4,) € D* can be transformed into another pair in D* with the same 
cardinalities and with invariance under down-up-shifts. They are defined as 
follows. 

For any Z C {0,1, 2, }” and any t € [n] set 


d,(Z) A Piste Pianist Pp Ry BPS Oca SS 1 and 
l{z: (21,- +5 2t-1) 2 2t415---,2n) © Z}| =). (9) 


This is the down-shift of Z in the t-th component. Similarly, u;(Z), the 
up-shift of Z in the t-th component is obtained by exchanging 0 and 1 in 
the t-th component of the sequences in d;(Z). We formulate an immediate 
consequence of our definitions. 


Lemma 1. For any (¥,Y) € Dx andt € [n], then also (di(4), us(V)) € Dy. 
We say that (V,Y) with ¥,Y C {0,1, 2}” is down-up-extremal, if 
(di(¥), ut(V)) = (4, ¥) for all t € [n]. (10) 


4. Relation to Cross-Intersection in Ly 


Next we introduce the mappings a; : {0,1,2}" > L, by 


oi(z") = {t: 2% =i1,1<t<n} fori =0,1. (11) 
We also put for Z c {0,1,2}” 
oi(Z) = {ai(z"): 2" € Z}. (12) 


These mappings make it possible to convert cross-disjoint pairs of clouds 
from the interval lattice Z, into cross-intersecting pairs of clouds from the 
Boolean lattice Ly. 

More precisely we have the following result. 


Lemma 2. Suppose that (X,Y) with X,Y C {0,1,2}”" is down-up-extremal. 
Then (X,Y) is cross-disjoint exactly if (oo(¥),01(V)) is cross-intersecting, 
that is, XY #0 for X € o9(X) and Y € 04(Y). 
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Proof. Suppose that (4, ) is cross-disjoint, but that (o0(X), 01 (Y)) is not 
cross-intersecting. Then there exist «” € V,y" © Y, and a non-empty set 
E Cc [n] such that (a,y:) = (1,0) for t € E and {az,y:} A {1,0} for 
t ¢ E. However, since (A’, V) is down-up-extremal, the sequence «/” obtained 
from x” by replacing for t € E with a = 1 by 2; = 0 must be in & 
and this sequence is not disjoint with y”. This contradiction proves that 
(o0(¥),01(Y)) is cross-intersecting. The reverse implication is obvious. 


5. Theorem 1 from Theorem 2 


Notice that for A C [n] 


Joa *(A)| = Joy *(A)| = 24. (13) 
Therefore also for any A, B C [n] and ¥,Y c {0,1,2}" 
lag (AN A] < 2°41 lop (B)NY| < 27-71. (14) 


Now in upper bounding |4’||Y| for (4, Y) € D* we can assume by Lemma 1 
that (4’, VY) is down-up-extremal and by Lemma 2 that (a0(4),01(V)) € Pn. 
Hence Theorem 2 implies that for U/ = 09(4) and V = 01(Y) 


WU)W(V) < 32-Y 
and thus by (14) and (13) 


JXV) = SO loo (UW) NA]. YE lor (V) NI 


UcuU Vev 
ay 2. yee awuwe<s™. 6) 
UcuU Vev 


The characterization of the optimal pairs follows from the one in 
Theorem 2. We use right away the sequence terminology. If (v,Y) € D* is 
optimal, then applications of operations (d;, u;) and oo, 01 lead to an optimal 
(U,V) € Pn by (15). 

By the uniqueness part of Theorem 2 for some ¢t € [n], without loss of 
generality say t = n, we have U = V = {Ae £,: t © A}. Furthermore 
(05 (WU), 0; 1(V)) = ({0,1, 2)" x {0}, {0, 1,2)" x {1}). 

It remains to be seen that (d;+,u;,') leads to no non-isomorphic pairs. 
We have 


a, 0,1, 2}"-* x {0}) =.(0) x {0} UO) & {17 0 a) x {2}, 
un ({0,1,2}"* x {1}) = VO) x {0} U Y(1) x {1} U V2) x {2}, 


where by the optimality the ¥(i)’s and also the Y(z)’s partition {0,1,2}"—1. 
Therefore for some i and some j, (2,2,...,2) € ¥(#)N Vj). But now by (7) 
necessarily {i,j} = {0,1} and ¥(i’) = @ fori 47’, V(r’) = @ for 7 4 7’. We 
have arrived at the desired form. 
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6. Auxiliary Results for Proving Theorem 2 


Obviously in deriving an upper bound on W(U)W(YV) for (U,V) € Pn we can 
always assume that U and Y are upsets. 

Moreover we can replace (U,V) by the pair of images (5;;(U), Si; (V)) 
under the familiar left-shifting S;;: 

For any e C £y, andi < 7, 


5,,(z) = { EMi i} iid B,je Band EACi,j} ¢e, 
‘J ~~ |B otherwise. 


for EF € € and 


(16) 


Sij(E) = {5i;(E) BE ce}. 


Just verify that (U,V) € Pn implies (5j;(U), Si;(V)) © Pn, that |.$;;(Y)| = 
24], |Si;(V)| = |V], and that 


W(Si3(U)) = WW), W(Sij(V)) = W(Y). (17) 


Clearly, finitely many applications of left-shifting operators results in a 
pair, which is invariant under further such operations. We call such a pair 
left-shifted. 

Let now (U,V) € P,, be a pair of left-shifted upsets. 

For the analysis of such pairs we introduce the following sets and families 
of sets. 

For A C [n] its projection on [n — 1] is 


pA=AN[n- 1] (18) 
and for A C £L,, we define 
pA={pA: AEA}. (19) 
Furthermore we partition A into 
Ap = {AE A: n¢€ A}, A, = {AE A: ne A}. (20) 


Thus also Up, U1, Vo, Vi, pli and pV; (¢ = 0,1) are well-defined. 
Since U and VY are upsets 


plo Cc py and pYo G pV1- (21) 


Lemma 3. If (U,V) € Pn cannot be enlarged without violating the cross- 
intersection property and U,V are left-shifted upsets, then for all U € Uy 
with pU Ep Uy \ plo there exists aV € V1 with pV € pVi \ pVo such that 


(i) pPUNgV =9,pU UpV = [n—- 1] and 
(it) For all V' with pV’ € pVi \p Vo and V’ 4 V necessarily »~UN pV’ #0. 


Exchanging the roles of U and V gives analogous statements. Furthermore 


(itt) plo = ph => pYo = pV1- 
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Proof. (i) For every U € Uy, with ,U € pl; \ plo there must exist a V € Vy 
with pV €, Vi\pVo with p,>UN,V = 0, because otherwise U is intersecting 
on [n — 1] with all V* € Vy, and by (21) with all V* € V, and thus one 
can enlarge U by U \ {n} in contradiction to our assumptions. 

Furthermore for this V,U U,V = [n — 1], because otherwise for some 

i € [n—-1]\ pUU,VU* = Sin(U) € U by assumption and U* NV = @ in 

contradiction to the fact that (U,V) € Pn. 


(ii) The forgoing argument shows that ,V’ ¢ »V. and that necessarily ,»U NM 
pV’ £90, because ,U U,V = [n — 1]. 

(iii) This follows from (i), (ii) and the analogous statements obtained by 
exchanging the roles of U and V. 


7. Proof of Theorem 2 


We proceed by induction on n. The case n = 1 is verified by inspection. For 
n > 2 we can consider a (U,V) satisfying the assumptions of Lemma 3. 
Case: ,Uo = pl. 

By (iii) in Lemma 3 we have also ,Vp = pV1. However (plo, »Vo) € Pn—1 
and by the induction hypothesis 


W (pllo)W (pVo) < 32"-?). (22) 
Now just calculate that in the present case 
W(U)W(YV) = [W(pUo) - 2 + W(pli)] - [W(pVo) - 2 + W(pV1)] 
= 3W(plUo) -3W(pVo) < 32-9), 


Case: Uo Ap U1. 

By Lemma 3 ,Vp # pV and there are subsets U € U4;, V € V, satisfying 
(i). For all V’ € Vo necessarily p»U MN, V’ 4 @ and again by Lemma 3 also for 
all V’ EV \{V}, Vi AV, pUN,pV' #9. This means that (UU {U \ {n}}, V\ 
{V}) © P, and symmetrically (U \ {U}, VU{V \ {n}}) € Ph. 

Moreover we see that 


WU {U \ {n}}) =WUY) +2717, (23) 
W(V\ {V}) = Wy) -2" l= wy) 2! (by (i), (24) 
WU \ {U}) =Wl) - 27-71, (25) 

and 


W(VU{V — {n}}) = W(V) + 2771141 = wy) +2l¥! (by (i). (26) 
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By the optimality of (U,V) we conclude with (23) and (24) that 
WU)W(V) >WUU{U \ {n}})W(V \ {V}) 
= (WU) +2) (wy) — 20") 
=WU)Ww(y) -2!4!-1w) + 2r-l8 wy) — 2” (27) 
and with (25) and (26) that 
WWU)W(V) 2WU\{U})W(V UV \ {n}}) 
= (WY) — 2"-l)(wy) + 2!"!) 
=W(U)W(y) +2! WU) — 2r-lUlw(y) — 2". (28) 
Now (27) and (28) yield 
— 2lUl-tyw) + 2ar-l8 wy) < 2” (29) 


and 
lw) —2"-llw(y) < 2”. (30) 


The double of the left hand side in (29) plus the left hand side of (30) equals 
3-2”-IYIW(V) and satisfies 


S07 (Pp) = 330". 
This is equivalent to 
W(v) < 241, (31) 
Similarly, by doubling the left hand side of (30) and adding to it the left 
hand side of (29) leads to the inequality 
Wey ae, (32) 
The two inequalities imply 
W(U)W(Y) < 2"*1 < 3°") for n > 2. (33) 
We calculate that for UU =V= {AC [n]:1€ A} 
W(U)W(Y) = 32-9), 
Finally we prove uniqueness. We have learnt already that for optimal left- 
shifted pairs (U,V) necessarily p»Uo = pl4i,pVo = pVi and that by the 
induction hypothesis W(,l4)W()Vo) = 32"-?). Thus U = V = {A C [n] : 
1 € A}. In general, every optimal pair (U*,V*) can be left-shifted to 
(U,V). Since the left-shifting operators don’t change cardinalities of subsets, 


there must be a singleton {t} in both, U* and V*. Consequently we have 
U*=V* ={AC [n]:t € A}. 
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8. A Common Generalization of Theorem 2 
and a Theorem of Erddés-Schénheim [9] 


In deriving their Intersection Theorem for multisets Erdés and Schénheim 
established first an Intersection Theorem with weights for £,. Those weights 
w(A),A € Ly, are increasing in |A|, whereas our weights w(A) = 2”7!4I 
used in Theorem 2 are decreasing in |A|. The latter does not allow to just 
choose the “heavier” one of A and A° = [n] \ A in order to construct an 
optimal configuration. This difference makes things more difficult in our case. 
Nevertheless we can give a unified approach. 

Let W = {W; : 1 <i <n} be positive reals which give rise to the weight 
won Ly: 


w(A) = II W;, for A Cc [nj (34) 
teA 
and 
W(A) = S© w(A) for AC Ln. (35) 
AEA 
Define 
a(n,w) = max{W(A): AC Ly, is intersecting} (36) 


(ie. AN BAO for A, BE A). 
We recall first a result of [9]. 


Theorem 3. 


a(nstw) <5 max (w(A), w(A°)) (37) 


and the bound is best possible when W; > 1 fori € [n]. 


Proof. Clearly an intersecting A can have at most one of the sets A and A‘ 
as member. 


One can construct an optimal intersecting family A(n,w) in the case 
w; <1 for i € [n] (38) 
as follows: 


(a) If w(A) > w(A®), then A € A(n, w). 

(b) If w(A) = w(A‘) and |A| > |A‘|, then A € A(n, w). 

(c) If w(A) = w(A*) and |A| = |A‘|, then take anyone of A, A° into 
A(n,w) and keep the other out of A(n,w). Clearly, W(A(n,w)) = 
15> max{w(A), w(A°)}. 

AcC[n] 
By (38) and (a)-(c) A(n,w) is an upset and also intersecting, because 
for A,B € A(n,w)AN B = @ implies A° D B and thus AS € A(n,w) in 
contradiction to (a)—(c). 
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However, without condition (38) the A(n,w) described above need not 
be an upset or intersecting. 

For example when w; < 1 for alli € [nl], then the biggest weight is assigned 
to the empty set, which cannot occur in an intersecting family. Therefore (37) 
is not tight. 

Fortunately an analysis of the proof of our Theorem 2 leads us to the 
right generalization. 

First of all by relabelling we can always assume that 


W1 > We S++ > Wn. (39) 
Let now m be the largest index with w,, > 1, if it exists, and otherwise 
set m = 0. Set W! = {w; : i € [m]}, w’(B) = J] w; for BC [ml]. 
Next define = 
{Ac [n]: AN [m] € A(m, w’)} ifm>1, 


Aaa oe ifm =0. (40) 
Clearly, 
A*(n,w) = A(n:w), if (38) holds. (41) 
Theorem 4. 
a(n, w) = W(A*(n,w)). (42) 
Proof. We use induction on n — ‘m. 
The case n — m = 0 or n = m is the case covered by Theorem 3. 
Case: n—m > 0 
Suppose that A is an optimal intersecting family, that is, 
W(A) = a(n, w). (43) 


Since (39) holds, the left-pushing operator $;; can be applied, because 
it does not decrease the total weight. We can therefore assume that A is 
invariant under such operations. Also we can assume that A is an upset, 
because adding an A’ C [n] to A with A’ D A for some A € A does not affect 
the intersection property and could only increase the total weight. 

We use again the projection p on [n — 1] and our earlier definitions 
pA, pA, Ai, and »A;(i = 0,1). Since A is an upset 


pAo C pA. (44) 


Case: »Ao = pAt. 
Since ,Apo is intersecting, by the induction hypothesis in this case 


W(A) = W(pAo) + wnW (pA1) = (1 + wn)W (pA1) 
< (1+ w,)W(A*(n - 1,w")) = W(A*(n,w)), 


where w” = (w;)/2;' and the last identity follows with definition (40). 
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Case: ,A # »A1. 
Here there is an A € A; with A \ {n} ¢ Ap and there must be a B € A, 
with 


Bn A= {n}, (45) 


because otherwise one can enlarge A by A \ {n}. Now the same ideas as used 
in the proof of Lemma 3 apply and give 


pAU,B = ([n-]] (46) 


and consequently that the B with these properties is unique, because 
otherwise there is an i € pAU,B and, since S;,,(A) € A, by (45) 


Sin(A) NB = (pAU {i}) N (Bp U{n}) = 9. 


This is a contradiction. 
Since A and B can be exchanged, we can assume that. 


w(A) > w(B) (47) 
and consequently that 
w(A\ {n}) = 2S w(A) > wi(B), (48) 


because w, <1, ifn—m> 0. 

Finally, since B is the unique member of A satisfying (45) (A\{B})U{A\ 
{n}} is intersecting and by (48) has bigger weight than A. This contradicts 
the optimality of A. The case ,Ap 4 »Ai cannot arise. 


9. Maximal Families of Disjoint Intervals 


One might wonder what can be said about families A C Z, with 
ANB=Ig for A,BE A. (49) 


The family A corresponds to a set A* C {0,1,2}" by the mapping WV of 
Sect. 2. A* has the property: 


for all a”, y” € {0,1,2}” for some t € [n]{xz, yx} = {0, 1}. (50) 


One readily verifies that |A| < 2”, and equality occurs for A* = {0,1}". 
In fact the problem is equivalent to Shannon’s zero error capacity problem 
for the matrix 


NIF OR 
NIF e © 


As Shannon noticed in [11], it equals log, 2 = 1. 


Cross-Disjoint Pairs of Clouds in the Interval Lattice 117 


References 


10. 


11. 


. R. Ahlswede and Z. Zhang, “On cloud-antichains and related configurations” , 
Discrete Mathematics 85, 225-245, 1990. 

. D.E. Daykin, D.J. Kleitman, and D.B. West, “The number of meets between 
two subsets of a lattice”, J. Comb. Theory, Ser. A, 26, 135-156, 1979. 

. R. Ahlswede and L.H. Khachatrian, “Sharp bounds for cloud-antichains of 
length two”, SFB 343, Preprint. 92-012. 

. R. Ahlswede and L.H. Khachatrian, “Towards equality characterization in 
correlation inequalities”, SFB 343, Preprint 93-027, to appear in European 
J. of Combinatorics. 

. R. Ahlswede and L.H. Khachatrian, “Optimal pairs of incomparable clouds in 
multisets”, Graphs and Combinatorics 12, 97-137, 1996. 

. P. Seymour, “On incomparable families of sets”, Mathematica 20, 208-209, 
1973. 

. D.J. Kleitman, “Families of non-disjoint subsets”, J. Comb. Theory 1, 153-155, 
1966. 

. R. Ahlswede and D.E. Daykin, “An inequality for the weights of two families 
of sets, their unions and intersections”, Z. Wahrscheinlichkeitstheorie u. verw. 
Geb. 43, 183-185, 1978. 

. P. Erdés and J. Schonheim, “On the set of non pairwise coprime division of a 

number”, Proc. of the Colloquium on Comb. Math. Dalaton Ftired, 369-376, 

1969. 

P. Erdés, M. Herzog and J. Schénheim, “An extremal problem on the set of 

noncoprime divisors of a number”, Israel J. Math., vol. 8, 408-412, 1970. 

C.E. Shannon, “The zero-error capacity of a noisy channel”, IRE Trans. Inform. 

Theory, vol. IT-2, no. 3, 8-19, 1956. 


Classical Results on Primitive and Recent 
Results on Cross-Primitive Sequences 


Rudolf Ahlswede and Levan H. Khachatrian 


R. Ahlswede (Deceased) - L.H. Khachatrian (Deceased) 
Fakultat ftir Mathematik, Universitat Bielefeld, Postfach 100131, D-33501 
Bielefeld, Germany 


Summary. When the kind invitation of Ron Graham and Jaroslav NeSetiil, to 
write in honour of Paul Erdés about aspects of his work, reached us, our first 
reaction was to follow it with great pleasure. Our second reaction was not as 
clear: Which one among the many subjects in mathematics, to which he has made 
fundamental contributions, should we choose? 

Finally we just followed the most natural idea to write about an area which 
just had started to fascinate us: Density Theory for Integer Sequences. 

More specifically we add here to the classical theory of primitive sequences and 
their sets of multiples results for cross-primitive sequences, a concept, which we 
introduce. We consider both, density properties for finite and infinite sequences. In 
the course of these investigations we naturally come across the main theorems in 
the classical theory and the predominance of results due to Paul Erdés becomes 
apparent. Several times he had exactly proved the theorems we wanted to prove! 
Many of them belong to his earliest contribution to mathematics in his early 
twenties. 

Quite luckily our random approach led us to the perhaps most formidable 
period in Erdés’ work. It reminds us about a statement, which K. Reidemeister 
[18, ch. 8] made about Carl Friedrich Gauss: “... Aber das Epochale ist doch die 
geniale Entdeckung des Jiinglings: die Zahlentheorie.” 


1. Classical Results 


At first we set up our notation. N denotes the set of positive integers and 
P = {p1, po,...} = {2,3,5,...} denotes the set of all primes. For the number 
u,v € N we write u | v, if u divides v. Further (u,v) stands for the largest 
common divisor and (u,v) denotes the smallest common multiple of u and v. 

In case (u,v) = 1, u and v are said to be relatively prime (or coprimes). 
The greatest prime factor of u is written as p*(u). For i < j, [i,j] equals 
{i,i+1,...,7} and (2,7] equals {i + 1,...,7}. Any set A C N can also be 
viewed as an increasing sequence (a;)92, where A = {a; : 7 € N}, and vice 
versa. We reserve the letter A for such sets or sequences. It is convenient to 
use the abbreviations A(x) = AN [1,2] and |B] for the cardinality of any 
set B. We also use $(z,y) = |{n € [1,2] : pt (n) > y} I. 

The lower and upper asymptotic density of A are 


A = A 
dA=tming4@ and A A=timop 42 gy 
Lo av L—->00 x 
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If d A=d A, then A possesses the asymptotic density d A= d A=d A. 
Related quantities are 


rae 1 1 x : 1 1 
6 A= lim inf log e = te and é A= lim sup log x > ie (2) 


the logarithmic lower and upper density of A. If 6 A = 6 A, then A possesses 
logarithmic density 6 A=6 A=6 A. 

In the first half of the century there was noticeable interest in the study 
of density properties of the set of multiples 


M(A)={meEN: forsomeae A a|m} (3) 


of infinite sequences A of positive integers. This naturally relates to the study 
of primitive sequences. 
A sequence A = (a;)%, is primitive, if 
Ay t aj for 1 x j. (4) 
One readily verifies that every A contains a unique subsequence P(A) which 
is primitive and satisfies 


M(P(A)) = M(A). (5) 
Actually, 


P(A) ={aeA:4b€ A,b fa and b| a} (6) 


One question of Chowla (see [2]) opened the subject: Does d M(A) exist for 
every A Cc N? 

This can readily be shown to be the case for all finite A, however, this was 
open for a longer time and finally settled in the negative by Besicovitch [2] 
in the infinite case. 


Theorem 1 (Besicovitch [2]). For every « > 0 there is an ACN with 
~ 1 
d M(A)> 5 and d M(A) <e. 


Actually, the A’s are constructed as unions of suitable intervals. The 
primitive sequence P(A) generating the M(A) of Theorem 1 gives the next 
famous result. 


Theorem 2 (Besicovitch [2]). For every > 0 there is primitive sequence 
A’ with 

= 1 

dA'>5-€ and d A’ <e. 


This shows that a question of Davenport and Erdés (see [7, 13]), whether 
every primitive sequence has asymptotic density 0, has a negative answer. 

We derive next an upper bound on d A because it is instructive and 
beautiful. For any primitive A = {aj,...,@a} C [1,2n] let d; denote the 
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greatest odd divisor of a;. Then necessarily d,,...,dq are all distinct and 
hence 

JA] =a<n. (7) 
Theorem 3 (Behrend [3]). For every primitive A, d A < 5. 
Example 1 (Everybody). {n+1,...,2n} is primitive and has density s. 


This simple fact is very relevant in the analysis of infinite primitive 
sequences. 
Now Paul Erdos enters the scene. 


Theorem 4 (Erddés [5]). For a primitive A DB {1} 


CoO 


1 
2 a; log a; ce 


co 


It is an open problem of Erdés whether }> ee < 9 eae: 
par i log pi 


By Abel summation it can be shown that for any set B c N 


0<d B<SbB<S5B<dBK<1. (8) 
Since in eae < os aera , by Theorem 4 6 A = 0 for 


primitive A. Also by (8) d A = 0. We state this result. 


Theorem 5 (Erdés [5]). For every primitive sequence A, d A= 6A =0 
or (equivalently) 


1 
rae aac asn—- oo. (9) 
ay<n % 


Logarithmic density has turned out to be an appropriate measure! Also, what 
can be said about the speed in (9)? 


Theorem 6 (Behrend [3]). There is a constant y such that for every 
primitive sequence A 


1 1 1 
ae = + forn>3. (10) 
log n ee “Gee log n)2 


In the proof the general case is reduced to A’s consisting entirely of square- 
free integers and their analysis is based on Sperner’s Lemma [1]! This gave a 
strong impetus also to combinatorial extremal theory starting with [12] and 
continuing with [24], ..., [30] and many, many others. 

Theorem 6 is best possible in the sense that cannot be replaced by o(n). 


Theorem 7 (S. Pillai [8]). There exists a positive constant c, such that to 
every x <3 corresponds a primitive set A, with 
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1 1 Cc 
se 
log x 4 ai (log log x)2 


Subsequently Erdés, Sérkézy, and Szemerédi [20] showed that c can be 
chosen as (27)~2 — for any € > 0 and that this is best possible. 

The last three theorems concern in essence only finite primitive sequences. 
Related to infinite primitive sequences in the true sense is the following. 


Theorem 8 (Erdés, SArkézy, Szemerédi [21]). For every infinite prim- 


itive sequence A 
1 ( log x ) 
re: 
a ( 


oe log log «) 


dlFR 


and this bound is best possible. 


We draw attention also to a survey paper of Erdos, Sarkozy, and 
Szemerédi [22] and to a related paper of Pomerance and Sarkozy [23]. 
Concerning d A there is the following improvement of Theorem 3. 


Theorem 9 (Erdés [14]). Let A be an infinite primitive sequence, then for 
every a € A of the form a = 2%(2u+1) < n;u,v >0, 


|A(n)| <n | HGseess 1) | 


Hence, d A < $. After Besicovitch’s negative answer to Chowla’s 


question, it is natural to address the next question: Under which conditions 
on A does d M(A) or 6 M(A) exist? Davenport-Erdés and Erdés answered 
all these questions: We derive here the simplest and most transparent result, 
Theorem 10 below, in order to explain the important role of a quantity, which 
we consider to be a density concept for sets of multiples and denote as wp. 
Since A is fixed, we write M for M(A). Further we denote by My», = 
M,,(A) the set of multiples of the first m elements of A, namely a1, a2,..., 
Gm. Mm can be represented as the union of a finite number of congruence 
classes, and therefore possesses asymptotic density. If we denote by M“)(n) 
the natural numbers, not exceeding n, which are divisible by a; but not 


divisible by anyone of a,,...,@;-1, then we have 
Mn(n) = LJ M(n). (11) 
i=1 


By inclusion-exclusion for every i = 1, 2,3,..., 


wt] [etal * 3 bert 


j<t k<j<i 
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and 


(i) 1 
dues ty MO = a Bara 


n—-Co n ay 


Therefore by (11) 


dM, =S dM. (12) 


w=1 
Since 0< Dd M® <1andd M® >0, lim dMm = 2, d M® exists. 
We define now the “density” yw by 
pp A= lim d M,,(A), ACN. (13) 


m—->oo 
Since M,,,(A) C M(A), we see immediately that 
w A<d M(A). (14) 


Suppose now that )> aj! < oo. Then dd M(A) <d M,,(A)+ >> + and 


Li 


4=1 i=m+1 


thus d M(A) <p A<d M(A). 
Theorem 10 ({17]). Jf >> a;' <0, then dM(A) exists and equals WA. 
i=1 


Here are the highlights. 


Theorem 11 (Davenport-Erdés [7], also [13]). For any ACN, M(A) 
has logarithmic density and 


5M(A) =d M(A) = pA. 


Theorem 12 (Erdés [11]). A necessary and sufficient condition for d M(A) 
to exist 18 
1 
lim Ii — M(n)| =0. 15 
im limsup — S- | (n)| (15) 


E>0 noo 
ni-&<aj<n 


Even though condition (15) looks complicated, it yields a useful sufficient 
condition. 


Theorem 13 (Erdés [11]). [fA CN satisfies for some constant c, |A(n)| < 
ben forn > 2, thend M(A) exists. 

The case A C P is included here. The result is best possible in the 
following sense. 


Theorem 14 (Erdés [12]). For any monotonically increasing function W : 
N-> Rx with lim U(n) = oo there exists an AC N such that 
noo 
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|A(n)| < const for large n, 


n U(n) 
log n 
but d M(A) does not exist. 


We present now two further results with many applications. 

The first of them was probably motivated by Example 1. It shows how 
the set of multiples of certain intervals behaves in density. This is the key idea 
in Besicovitch’s construction [2]. Erdés improved the length of the intervals. 


Theorem 15 (Erdés [5]). The intervals (T1~*,T] CN satisfy 
lim a MT? -= 7) = 0) 
e—+0 
T- 00 


The second result is the famous Behrend Lemma in a dual formulation, 
that is, for X C N we use M(X) instead of N\ M(X). 


Lemma 1 (Behrend [10]). Let A,B CN be finite, then 
d M(A)-d M(B) <d (M(A)N M(B)). 


Moreover, equality holds exactly if the primitive sets P(A) and P(B) 
satisfy (a,b) = 1 for all a € P(A),b € P(B). 

Finally there are also several papers concerning the growth of ¢(a, y) [15]. 
We use later only the following result. 


Theorem 16 (Chowla and Vijayaraghavan [15]). 
_ G(x, 2°) 
lim 

x 


woo 


1 1 
= logy, Jor 5 S@ <1. 


Remark 1. We apologize for not including in our sketch several results 
of basic nature such as Rogers inequality [17] and others. Our selection is 
guided by our present research interest. The reader may consult the books by 
Halberstam and Roth [17] and Hall and Tenenbaum [19]. 


2. New Results 


We introduce a seemingly basic and new concept. 
The pair of sets (or sequences) (A,B) with A,B C N is called cross- 
primitive, if 
at band bf{a for allac A,be B. (16) 


It is convenient to denote the set of all cross-primitive pairs (A, B) with 
A,B Cc N(x) (resp. N) by Cross(x) (resp. Cross(co)). We are again interested 
in density properties. We begin with the finite case and define 


[Al - |B] 


oz) ~ (A,B) eCross(2) x? 
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Theorem 17. For all x €N, c(x) < 4 and 


Jim C(x“) = iz 


Remark 2. As analogue for a primitive sequence see the simple Example 1 


and (7). We believe that our construction is optimal for large x. Erddés thinks 


that the deviation ce max |A||B| from = is of the order x° for some 


A,B)€Cross(a 
a>. 


The infinite case shows more complex behaviour and that’s the case where 
also several classical results on primitive sequences are used. 


Theorem 18. 


max dA-dB= Z, 
(A, B)€Cross(co) 16 


The maximum is assumed for a pair with densities. 


One auxiliary result for proving this Theorem deserves special attention. 
It is an infinite form of Behrend’s Lemma 1, but by no means an easy 
extension. On the other hand, it involves the essence of the Davenport-Erd6és 
Theorem 11 and expresses it in an elegant way. 


Lemma 2. For arbitrary A,B CN 
d M(A)-d M(B) <d (M(A)N M(B)). 


We use another auxiliary result, which should be known to the experts, 
but we could not find stated in the literature. 


Lemma 3. For anyO <A <1, and any q € P, a < X, there exists a set of 
primes Q = {m1 <q2 <--:} CP with 


d M(Q) =). 
Finally, we enter the world of pathologies discovered by Besicovitch. 
Theorem 19. 


sup dA-dB=1. 
(A,B) €Cross(co) 


The supremum cannot be assumed: For A= {ay,...}, d B<1— aC 


Remark 3. The construction in the proof of this Theorem can be used to 
show that in Lemma 2d cannot be replaced by d. 
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3. Proof of Theorem 17 


Since for (A,B) € Cross(x), A and B must be disjoint and 1 ¢ AUB 


necessarily |A| + |B] < x and \Al}aI < 4. Therefore also c(x) < } for all 
xeN. 


To complete the proof, we have to construct a sequence (Az, B,)°, with 
(A;, Bz) € Cross(x) and 
Az| + |B 1 
jay ela 7 (17) 


=L—-Co a2 4 


We define for a 0, 4 <6@< 1, which we adjust later, 
Ay = {a €N: a) ® <a< a and pt(a) < 2°}, 
B, ={bEN:b <2 and pt(b) > 2°}, 


and observe that for a @ in the specified interval (A,;,B,) € Cross(x). Hence 


|A,| > a2—a'—® — |B, |. (18) 
Now Theorem 16 says that lim [al = log § and since x!~® = o(z), 
df 1 
|Bz| ~ xlog 9 |A;| = (1 — log q: (19) 


We choose now 6 such that log 4 = 1—log 4 = 4, that is, 9 = e~? ~ 0.6065 > 
5. Clearly, (19) implies now (17). 


Remark 4. A good estimate of |B.| is possible, because B, can be partitioned 
according to the biggest prime in the decomposition of its members. These 
biggest primes are essentially known in magnitude by the Prime Number 
Theorem. Our first proof followed this line. Then we learnt about [15]. 


4. Proof of Lemma 2 


Behrend’s Lemma implies that for every n € N 
d M(A(n))-d M(B(n)) <d (M(A(n)) a M(B(n))). (20) 


Since A(n), B(n) and thus also A(n) M B(n) are monotonically increasing in 
n, we have 


M(A(n))N M(B(n)) Cc M(A)N M(B) 
and therefore 
d (M(A(n)) A M(B(n))) < @ (M(A) 0 M(B). (21) 


S d (M(A(n))  M(B(n))) = d (M(A(n)) 1 M(B(n))), (20) and (21) 
imply 
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d M(A(n)) -d M(B(n)) <4 (M(A)n M(B). (22) 
Now by Theorem 11 
lim d M(A(n)) =d M(A), lim d M(B(n)) =d M(A) 


n—-+>Cco n—->co 


and therefore 


d M(A)d M(B) < d (M(A)N M(B)). 


5. Proof of Lemma 3 


For any Q = {qi < q2 <...} C P by the Prime Number Theorem (or weaker 
versions) 


|QM [1,n]| < const - 


log n- 
Therefore by Theorem 13 M(Q) possesses asymptotic density and by 
Theorem 11 

d M(Q) =4 M(Q)=Soa™, 


where 


(=2-y + > ne (23) 


jKi G5 k<j<i Gk hi 


because Q C P. Therefore 


doa =1-TT (1-2) 


jel Gi 

1 
oe % 
and for any null sequence {a;}?2, of positive numbers with )7;", a; = co any 
real number r > 0 equals ee ; @ij for a suitable subsequence {aij }721- 


and now the statement follows from )>>° = ©, because — log(1 . )> 


Ae 


6. Proof of Theorem 18 
We show first that for (A, B) € Cross(co) 
ree eee (24) 
16 


We associate with (A, B) the sets 
A* = M(A) \ (M(A) 0 M(B), 
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and observe that also (A*, B*) € Cross(oo). Moreover, we notice that 


Ac A* and BC B* (25) 
and that 
M(A)N M(B) = M(C), (26) 
where 
C = {(a,b):a€ A,be Bh. (27) 
By our definitions and properties (25) and (26) we have also 
An [1,2] C (M(A) 9 [1,2]) \ (W(C) 9 [1, z]) (28) 
and therefore 
[An [1,2] < |M(A) 9 [1 a]] — |M(C) 9 [1,2]. (29) 


Let now (2;)%2, be an increasing sequence of positive integers with 


=d M(A). (30) 
I oo Hort 
Then by (29) and (30) 
aj Acti Od 2a i ee OO el S ae EO) 
i-oo XG i oo Ly 


(31) 
Now we lower bound d M(C) by Lemma 2: 
d M(C) =d (M(A)N M(B) > d M(A)-d M(B) 
and conclude that 


d A<d M(A)~d M(A)-d M(B)=da M(A)(1-d M(B). (32) 


Symmetrically 
d B <d M(B)(1—d M(A)) (33) 
and thus finally 
d Ad B<d M(A\(1—d M(A))-d M(B)(1-d M(B) < 5-5 = 
We construct now (A, B) € Cross(oco) with 
dvod Be 4 (34) 


By Lemma 3 there is a @ = {qi C q2...} C P with 


d M(Q) = ; and q, > 2. (35) 
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Set 
A={aEN:2|aandq;{ for all g € Q}, 
B={bEN:2{band bE M(Q)}. 
Equivalently 
A = M({2}) \ (M({2}) 9 M(Q)) and B = M(Q) \ (M({2}) 9 MQ). 
Also, it is clear that (A, B) € Cross(oo) and that M({2})N M(Q) = M(C), 
where 
C = {2qi: a € Qh. 


Obviously |C'M [1,n]| < const;=2; and again by Theorem 13 M(C) has 


asymptotic density and is given by the formula d M(C) = >> q?, where 
i=1 


AO) _ 1 1 1 _ gq? 
: 2d: ra 205% 20h 2 


k<j<i 
Hence d M(C) =$ > q = d M(Q) = +. Therefore 


d A=d M({2})-d(C) = 4- 


1 
= 5-7 = pd B=4dM(Q)-d(C)= 


1 
4 
and (34) holds. 


7. Proof of Theorem 19 


Let us fix 6 > 0,6; > 0 forte N, 956; =6 andO<O<1. 
i=l 
By Theorem 15 for a 6; > 0 there are positive numbers T'(6;) and A(d;) 
such that 
d M((T’-*,T]) < 6; for T > T(d:),r4 < A(6;). (36) 


We fix arbitrary A; € (0,A(0;)) for i € N and A* € (0,1). Now (36) and 
the definition of density tell us that for T; > T(d;) (jf = 1,2,...,7) and 
S > S(A*,T), To,.--, Ti, A1,---,i, 61,---,6;) (suitable) simultaneously 
1—aAj = \* 
IM((T; “’, T)) N[S*-*’, SI] 
S — Si-*”* 
Now let R, be an integer with the properties 


<6; for j <i. (37) 


1 
Ry > T (61) and = <é. (38) 
Ry" 


We fix the interval [R{~*', Ri]. Let L, be an integer with the properties 


130 Rudolf Ahlswede and Levan H. Khachatrian 


ia" Ss. Ry and. Lt SB Oa, Bis Anois (39) 

We choose the interval [L}~™*, L,]. Furthermore, we choose Rg, Lz such 

that Ly < i, Ro < in, ie > max{T (62), S(Ao, In, 1, o1)}, ou < 

2 

6 and i > S(A2, Ri, Re, A1, A2, 01, 62). We fix now intervals [R3-?, Re] 

and [L}~*?, Ly]. Continuing this procedure, for every i¢ N we choose Rj, L; 

such that Lj-1 < Teese ue ai)? Ae on aS max{T'(0;), S(Ai, L1,..., Li-a, 

Misives Mis Oiges espa) ye Ri < Q, etre as > S04, Ri,.--, Ri, A1,-- +5 Ai; 

6,.--,6;). We fix intervals [R}~*‘, Ri] and [L}~*‘, Li]. By our construction 
one has for every i € N 


|IM([R;~*, Ry]) M[Li-™, Lill 


$= (Ek < 6; for all j <i (40) 
and analogously 
\M(IL;-™, Lj]) 0 [R™, Rill oe 
Ro Re < 6; for all j <i-1. (41) 
Now we introduce (disjoint) sets 
A* = Bl bramaeey ieee = | Jie Ea] (42) 
i=l i=1 
and consider the sets 
A= A* \ M(B"), B = B*\ M(A*). (43) 


It is clear from this definition that (A,B) € Cross(co). The upper densities 
dA and d B are lower bounded now with the help of (40) and (41). For every 
i € N the number of integers in A, which do not exceed R; is at least 


[Ri-™, Ri] \ (M(B*) 0 [Ri-™, Ril)| 


SSR) - Sim , lg), BS) 
i-1 
> (Ri — Ri) — (Ry — RE) 505; > (Ri - BE™ YA — 4). 
j=l 
1=rj 
Therefore, for every i > 1, any = aces (1 —6) = (1- - zs) 


(1— 6) > (1— @)(1 — 6), because mr < @ for all i € N. Hence, d A > 
(1 — 6)(1 — 6). Similarly d B > (1 — 6)(1— 6) and therefore 
d A-d B> (1—6)?(1—6)?. 


The result follows, because 6 and 6 can be made arbitrarily small. 
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8. Concluding Remarks 


The notion of cross-primitive pairs can be generalized to that of cross- 
primitive k-tuples of sets (A1,..., Ax). The understanding here is that any 
pair (A;, A;) (i € 7) is cross-primitive. Cross(x) then becomes Cross; (x). We 
guess that 


1. lim max kA = (1)k 
Z—+00 (Aq,..., An) €Crossp (x) te tne (x) 

2. a kd Aj — (4)¥ (EE) 
is ce (x) ( k ) 

3 sup Ik, d Ai = 1. 


, (A1,...,Ax)€Crossz (oo) 
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Summary. We show that if a set B of positive integers has positive upper density, 
then its difference set D(B) has extremely rich combinatorial structure, both 
additively and multiplicatively. If on the other hand only the density of D(B) 
rather than B is assumed to be positive, one is not guaranteed any multiplicative 
structure at all and is guaranteed only a modest amount of additive structure. 


1. Introduction 


Given a subset B of the set N of positive integers, denote by D(B) its 
“difference set”. That is D(B) = {w-—y: 2,y € Banda > y}. We are 
concerned here with difference sets which are “large” in one of two senses. 
That is, we ask either that d(B) > 0 or that d(D(B)) > 0 where 


d(A) = limsup JAN 41,2, cn} lfm. 
n—->Cco 


We show in Sect.2 that if d(B) > 0, then D(B) has an incredibly rich 
algebraic structure. We show for example that given any function f : N— N, 
there must exist a sequence (rp)?2., so that {}),¢pGn-@n : F is a finite 
nonempty subset of N and for each n € Fil < an < f(n)}U {[],ep 00": F 
is a finite nonempty subset of N and for each n € Fi 1 <a, < f(n)} C D(B). 
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With no sort of largeness assumptions at all (beyond the requirement that 
B should have at least three members) one must always be able to get some 
a and b with {a,b,a+b} C D(B). (Given « < y < z in B, leta=y-—~z and 
b = z—y.) Infiniteness by itself doesn’t help much. Indeed, it is easy to see that 
if B = {2”: n € N}, then for no a, b, and cis {a, b,c, a+b, a+c, b+c, a+b+c} C 
D(B). On the other hand, we show in Sect.3 that if d(D(B)) > 0, one can 
always find a, b, and c with {a,b,c,a+b,a+c,b+c,a+b+c} C D(B). 

We have not been able to determine whether D(B) (where d(D(B)) > 0) 
must contain some 4 elements with all of their sums. However, we do show 
in Sect.3 that one can find sets B with d(D(B)) arbitrarily close to 1/2 such 
that D(B) contains no five elements and all of their sums. We also show that 
we can find sets B with d(D(B)) arbitrarily close to 1 such that D(B) does 
not contain any {a,b,a- b}. 


2. The Difference Set of a Set of Positive Density 

We show here that if d(B) > 0, then D(B) has a rich additive and 
multiplicative structure. Many of the results in this section are from the 
dissertation of the first author [2]. We begin by stating a well known result 
about sets of positive upper density, whose proof we leave as an exercise. 


Lemma 1. Let A CN such that d(A) > 0 and let k © N such that 1/k < 
d(A). Then given any t,,t2,...,t, in N there exist some i <j in {1,2,...,k} 
with d((A — ti) N(A—t;)) > 0. 

Note by way of contrast that it is easy to get two disjoint sets both with 
upper density equal to 1. It is an immediate consequence of Lemma 1 that if 
d(B) > 0, then D(B) is an IP*-set. That is, given any sequence (a,,)°, in 
N there is some finite nonempty subset F' of N such that ee pia] DB). 
(To see this: for each i, let a; = )>),_, Un and pick i < j such that d((B— 
a;)(B—a;)) > 0. Then >>) _,,, tn € D(B).) Therefore, by Bergelson and 
Hindman [4, Theorem 2.6] there is some sequence (77,)?°_, with {)),¢p tn: F 
is a finite nonempty subset of N}U{]],,cp 2n : F is a finite nonempty subset 
of N} C D(B). We show in Theorem 5 below that a stronger conclusion holds, 
(without invoking any results from [4]). 

We shall utilize in our proofs two results from ergodic theory. The first of 
these is Furstenberg’s famous correspondence principle which was first used 
in his proof of Szemerédi’s Theorem [6, 7]. Recall that a measure preserving 
system is a quadruple (X,6,y,T) where X is a nonempty set, B is a o- 
algebra of subsets of X, yw is a nonnegative o-additive measure defined on B 
with u(X) = 1 (so that (X, B, 1) is a probability measure space), and T is an 
invertible measure preserving transformation of X. (That is, T is continuous, 
and whenever B € B, T~'B € B and pp(T~'B) = p/(B).) 
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Theorem 1 (Bergelson [1, Proposition 2.1] and Furstenberg [6, 
Theorem 1.1]). Let B CN with d(B) > 0. There exist a measure preserving 
system (X,B,,T) anda set A € B such that (A) = d(B) and for alln EN, 
d(BN(B—n)) > p(ANT™A). 

Given measure preserving systems (X1, 61, p41, 71) and (X2, Bo, 2, T2) we 
follow standard practice and denote by (X1 x X2, Bi x Bo, pn X p2,Ti x T2) 
the system where X, x X92 is the cartesian product, B, x Bz is the o-algebra 
generated by sets of the form A, x A» for A; € By and Ag € Ba, p11 X pe 
is the measure on B, x By determined by (jy X fu2)(A1 X Az) = pi(A1) - 
}tg(A2) and T, x T2 is the measure preserving transformation defined by 


(T, x Ta)((1,2)) = (T1(#1), T2(x2)). 


Theorem 2. Let T,,72,...,T% be invertible commuting transformations of 
a probability measure space (X, B, 1). Assume that p1(n), po(n),..., pe(n) are 
polynomials with integer coefficients such that p;(0) = 0 fori € {1,2,...,k}. 
Let A € B with p(A) > 0. Then there exists n € N such that u(AN 
qe gree AY iy, 


Proof. This is exactly [3, Theorem 4.2] except that the conclusion there has 
n € Z\ {0}. To derive this version we utilize the product space (X x X,B x 
By xX wu). For i € {1,2,...,k}, let S; = T; x +, where z is the identity. 
For i € {K+1,k+2,..., 2k}, let S; =cxT;_; and let p;(n) = pj_.(—n). Then 
S1,S2,..., So, are invertible commuting transformations of (X x X, Bx B, x 
js) and (wu x ys)(A x A) > 0 so pick (using [3, Theorem 4.2]) n € Z \ {0} such 
that (ux p)((Ax A)SPL™ gP2™) _ gP2k") (4x A)) > 0. Ifn > 0 we see from 
the first coordinate that p(AN TP 7?2™ |, TP«™ A) > 0. If n <0 we see 
from the second coordinate that p(AQ TO" T22—) . pe) A) = iy 


We shall see in Theorem 5 that whenever d(B) > 0, D(B) contains sums 
and products from a sequence where terms are allowed to repeat a restricted 
number of times. We present first a special case so we may introduce the 
proof in a relatively uncomplicated setting. 


Theorem 3. Let B C N with d(B) > 0. Then there is some sequence 
(tn)? such that {ocr antn : F is a finite nonempty subset of N and 
for eachn€ F, an € {1,2}} U{[],ep ut” : F is a finite nonempty subset of 
N and for each n € F, ay € {1,2}} C D(B). 


Proof. Pick by Theorem 1 a measure preserving system (X, 5, yu, T) and some 
A € B such that (A) = d(B) and for each n € N, d(BN(B-n)) > 
w(AMT"A). Observe that {n EN: w(ANT"A) > 0} C D(B). For m € 
N and a sequence (7,)7_, in N let E((an)P_1) = {ener antn: F isa 
nonempty subset of {1,2,...,m} and for each n € F,a, € {1,2}} and let 


C((tn)h1) = (ler 2" : F is a nonempty subset of {1,2,...,m} and for 
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each n € Fyay, € {1,2}}. We construct a sequence (%)°°, by induction so 
that for each m, E((ap,)r_, UC ((an)h™1) C {n EN: w(ANT”A) > 0} which 
will suffice by our observation. 

To ground the induction consider the measure space (X x X x X,B x Bx 
B, ux px p), let S$; = (T xext), $2 = (ex T xt), $3 = (&xexT), pi(n) =n, 
p(n) = 2n, and p3(n) = n?. (Recall that v is the identity.) Pick by Theorem 2 
some x; € N such that (ux wx p)((A x Ax A)N penton) paler) grater A x 
Ax A)) > 0. From the first coordinate we see that (AM Ty"! A) > 0, from 
the second coordinate we see that (AM T?"'A) > 0, and from the third 


coordinate we see that u(AN TA) > 0. Since E({a,)1_,) = {a1, 221} and 
C((an)i_1) = {21,27}, the grounding is complete. 

Now let m € N be given and assume we have chosen (a,)"2;' with 
E((an)™!) U C((tny™2)) C {n EN: p(AN TA) > 0}. Let b = 37-1 


and enumerate (with repetitions if need be) {0} U E((an)"2') as (y; ys , and 
enumerate {1} U C((ap,)7"') as (z;)° ;=1: Now consider the measure space 


n=1 
ve ee ,B, X92). Let H = Ca lee x (ANT A) x Ax A), let 
= xX? 2p, and nabs that fi(H) > 0. (Our induction hypothesis tells us that 


a ji( ANT A) > 0.) Let $1 = X8_,(Txexexe), So = Xb_y (UxTXixe), 93 = 
XP_1(ux ux T*s xv), and S4 = XP _ (Lx uxexT*?). Let pi(n) = n, po(n) = 2n, 
p3(n) = n , and p4(n) = n?. Pick by Theorem 1, some 2, € N such that 
[i(H 7 SP rm) gP2(@m) ops(em) cpalem) Fr) >, 9, 

To see that E((tn)"™,) C {n © N: w(ANT™A) > O}, let 0 F 
FC {1,2,...,m} and for each n € F, let a, € {1,2}. If m € F, then 
diner Onin € E((an))™='), so we assume m € F. Pick j € {1,2,...,b} 
such that )),,¢p@n2n = Yj + @m%m. If am = 1, we see by looking at 
coordinate 47 — 3 that u(ANTY%ANMT*™(AN T%A)) > 0; in particular 
WAN TY%+*™ A) > 0. If am = 2, we see by looking at coordinate 4j — 2 that 
w(ANT%AN T22™(A TY A)) > 0; in particular u(A NM T%+2%™ A) > 0. 

To see that C((an)™,) C {n € N: p(AN TA) > 0}, let 0 F 


FC {1,2 ...,m} and for each n € F, let a, € {1,2}. If m ¢ F , then 
Ilner 22 me O(n) ~'), so we assume m € F. Pick j € {1,2,...,b} such 
that Ter 22 an = 2,7: a9". If am = 1, we see by looking at pesrdinate 


4b — 1 that (AN (T*)®™A) > 0 so that p(ANT*"A) > 0. If am = 2, 
we see by looking at coordinate 4b that u(AN (T?3)®m A) > 0 so that 
WAN T2EmA) > 0, 


We observe in fact that if one has sets B,, Bo,...,B, with each d(B;) > 0, 
then the conclusion of Theorem 3 applies to ()j_, D(B;). To see this one 
simply starts with the product system (X7_, Xj, X?. Bi, XP_1 Mi, XP, T;) where 
(X;, Bi, ui,T;) is the system given by Theorem 1 for B;. 

Recall that a set B CN is an IP* set if and only if whenever (r,)°°, isa 
sequence in N, one has F'S((x,)°,) NB #0. We pause now to sieeve: that 
neither of the conclusions of Theorem 3 follow from the fact that D(B) is an 
IP* set. 
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Theorem 4. There is an IP* set A such that for no sequence (a)°L, ts 
{ener antn : F is a finite nonempty subset of N and for each n € N, 
dn € {1,2}} C A and for no sequence (Yn)? 8 {]]nep yn” : F is a finite 
nonempty subset of N and for each n € N,an € {1,2}} C A. 


Proof. Let B = N \ {x? : x € N}. Since one clearly cannot get any sequence 
(tn), with {0 ,,¢p £n : F isa finite nonempty subset of N} C {2 x EN}, 
one has that B is an IP* set. And no sequence (yn)°, has any y? € B. 
Now by Deuber et al. [5, Theorem 3.14], there is a partition N = C; UC) 
such that for no sequence ()?21 is {}o ep £n : F isa finite nonempty subset 
of N} C C; and for no sequence (Yn)%1 is {Tne r, Yn + Done, 2Yn : Fi, and 
F» are finite nonempty subsets of N and max F, < min Fh} C Cg. Then C2 
is an IP* set. Let A = BN C3. Since the intersection of two IP* sets is again 
an IP* set (see [4]), we have that A is as required. 


The next theorem is our major result of this section. Considerably 
stronger statements are in fact available with the same proof. However, we 
are trying to keep the results easily comprehensible. 


Theorem 5. Let B CN with d(B) > 0 and let f : N + N. Then there is 
some sequence (an)P-_, such that {)) cp 4ntn : F is a finite nonempty subset 
of N and for eachn € F,a, € {1,2,..., f(s} U{]],<r an” : F is a finite 
nonempty subset of N and for each n € F, ay € {1,2,..., f(n))} C D(B). 


Proof. We describe how to modify the proof of Theorem 3. First define 
E((an)ir_1) and C((an)r 1) analogously. At the grounding level one takes 
the measure space (x2#@) a ee ca "B, ea ‘). One lets P;(n) = i-n 
for i : {1,2,...,f(1)} and lets pie =a “F(1)=1 fori € {f(I)+1,f) + 
S41), 

AG oie caso stage, one lets b = [["7'(f(i) + 1) and enumerates 
E((tn)™") U {0} as (yi) , and enumerates {1} U C((an)™"7') as (z;)° 


j=l 
Then one uses i measure space (xp2 rile’ g xe aI ons 2-f(™) 1), and lets 


= XxX (An THA iay . Using ‘the ‘a definitions of 
j=l as i=1 
a S2,-.-,So.f(m) and pi, p2,---,P2.f(m) one completes the proof. 


3. Additive Structure in Dense Difference Sets 


For the remainder of the paper we look at difference sets D(B) where we no 
longer require that d(B) > 0, but only that d(D(B)) > 0. 

Because difference sets are defined additively one would not necessarily 
expect them to have any multiplicative structure. On the other hand, 
Theorem 5 might make one suspect that they would have some multiplicative 
structure. We begin this section by showing that they need not. 
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Theorem 6. Let « > 0. There is a set B such that d(D(B)) > 1—€ and 
there do not exist a and b in N with {a,b,a-b} C D(B). 


Proof. Pick a € N such that 1/2% < e. Define a sequence (f(r))2.9 by f(0) = 
2+aand f(r+1) = 2(f(r)+a)+1. Let (an)°, enumerate Ur {27 2h) + 
jee ie ae 1} in increasing order and note that for alln in N, a, < Qf(r), 
Let B = {2 :n ENPU {27 42, : n © N}. Then D(B) = {an : n € 
N}U {2f( + ge, — 260) > m,n € N and m < n} U {2f™ — 2f™ :mneN 
and m < n}U {2f™ 42, — 2/0) — a2, :m,n © N and m < n}U {2f™ — 
2f(m) _ x, :m,n € N and m < n}. Now given any r € N we have |{r, :n € 
N} (1, 2,...,27+%}| > af +e — of@) so d(D(B)) > 1-1/2? >1-e. 
If a = an, then for some r € NU {0}, we have 2f( <a < 2f(r)+@ Tf 
a € D(B)\ {xp :n € N} then there exist m and n in N with m < n such that 
Qf) of _— 7 <a < FM 4, —2F(™, Since 2f(™) —2f(™ gy, > Qf(r)-1 
we conclude that for any a € D(B) there is some n € NU{0} with 2f()-1 < 
a < 2/()+2_ Now suppose we have a < b in D(B) such that a-b € D(B). 
Pick m < n <r in NU {0} such that 2/0)-1 < a < 2flm+te, gf(M—-1 < 
b < 2f)+e and 2F)-1 < a-b < 2+ If n < r we have 2f(-1 < 
ab < WOM+H M429 50 f(r) < flm) + f(n) + 2a < 2f(n) + 2a < f(r), a 
contradiction. Thus n = r so that 2f(™M+f(™)-2 < q.b < Qhr)te — of(rite, 
Then f(m) <a+2= f(0) < f(m), a contradiction. 


Theorem 7. Let B CN and assume d(D(B)) > 0. There exist a, b, c in N 
such that {a,b,c,a+b,a+c,b+c,a+b+c} C D(B). 


Proof. If d(B) > 0 we are done by Theorem 5 so we assume d(B) = 0. 
Enumerate B in order as (x,)°2,. The result of this theorem is almost free. 
That is given any r >s >n> k, if we let a = 2, — 25, b = Xs — Xn, and 
C=, —X,, thena+b=2,—-—a%,b+c=2,—a2,, anda+b+c=2, — Zp. 
The only problem then is to find r > s >n > k such that 7,—25+2%n—2rE € 
D(B). 

Let a = d(D(B)) and pick | € N such that 1/1 < a. For each t we have 
d(B —t) = d(B) = 0. Let E = D(B)U,_,(B — 2x). Then E = {2,— 2, : 


r,s €Nandr>s > 1} and d(£) =a. Pick by Lemma 1 some k < n <1 
such that d((E —2,)N(£ —2,)) > 0. In particular (EF — rp) (E-—2n) £0 
so pick r>s >landt>m>Isuch that x,—2,—2@, = 21 —Lm — Xn. Then 


r>s>l>n>kanda,—2%7,+2%n— CE = 4 — Lm as required. 


We now set out to show that we can produce sets B with d(D(B)) 
arbitrarily close to 1/2 such that D(B) does not contain F'S((a,,)?_,) for any 


1, 42,43, 04, @5. (Here FS({an)P_1) = {donep an : OA F C {1,2,...,m}}.) 
We first introduce the sets B (whose dependence on a is suppressed). 


Definition 1. Fira € N with a > 4. Let (ap)°, enumerate in increasing 
order (NA+ 1) 71 (UF (207, 20? +13. 2 OF 1). ta B= fo: 
neEN}U {20% 4+ a, :n€ N}. 
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One sees immediately that one can get a1, a2, a3, and a4 with F'S((an)4_,) 
C D(B). Indeed let s < m be given, pick | and r such that 2°°*? < 2 < 
LEF QM — I << Q°THO—? Det ay, = a + 2°™ — 2°%, and pick v and t such that 
ott? < a < ay 20% — 2° < 20+0-2. Then let ay = 2° — 2° = x, — 21, 
Ao = 2" = 2", da = 2" =O. arid a4 = ey. Then PS((a,)> 4) © DB). 
In fact, one can show that any sequence of length 4 with its sums contained 
in D(B) must fit this description. The computations are longer and more 


painful than those on which we are embarking, so we omit them. 


Definition 2. Let a and (x»,)°°_, be as in Definition 1. Then Ay = {am :n € 
N}, Ag = {20% + ey — 2°": n,m EN and m < n}, Ag = {297 — 2° — ayy, : 
n,m €N andm < n}, Ag = {20% — 2° 2 nym € N and m < n}, and 
As = {20% + @y — 2°" —am:n,m EN andm <n}. 

Observe that D(B) = Gian Aj. 


We next prove two lemmas to aid in our computations. 


Lemma 2. Let n1,n2,7™m1,m2 € N and let v1, 72, d1, 62 € {0,1} with ng > 
m1 and mg > my. If 2°"? +2°™! + yan, + YV1Ln, = 20 + 2°! + botm, 4 
b1%m,, then 


(1) (na, m1, 72,71) = (M2, m1, 62, 61) or 
(2) ng = ny and (nz, 71, 72,71) = (me, m1, 61, 52). 


Proof. We assume without loss of generality that ng > mg. If we had ng > m2 
we would have 2072 + 2°! + d0%m, +61%m, < 4:20 = Qomat? < game < 
2072 + 29°™ 4 ro%n,. + Y1Ln,, a contradiction. Thus np = m2. Assume first 
that y2 4 d2 and assume without loss of generality that yo = 1 and 62 = 0, 
Then tp. = 2°! — 2° 4 d1%m, — Y1Ln,. We claim m, = n4. If we had 
mi <n, we would have tp. << 2°! — 2°"! + dyam, < 2-20) — 2° < 0. 
Suppose now m,; > n,. Then tp, < 2° + 64¢m, < 2%™ +! and gn, > 
QO DE ey ene De 2a > 20-1 But for some r we have 
Qort? < ay < 2°7+9-2 a contradiction. Thus m; = n1 80 Zn, = (61-1) En, 
and hence 6; = 1,7, = 0 and n, = ng so that conclusion (2) holds. 

Now assume 72 = 62. Then we have 2°! + yn, = 2° + 61m ,. As in 
the first paragraph we see n, = m 1 so y1N1 = 61N1 SO 1 = O14. 


Lemma 3. Let n1,n2,73,7™m1,m2,m3 € N and let 71, 72, ¥3, 61, 62, 63 € 
{0,1} with ng > no >n1 and m3 > m2 > my,. Assume 2°73 4+ 2072 4 QO 4 
3 ng “a 2 no a Wen, = Qe ae Dee 2°m4 oe b32mz = 622m am b12m,- 
Then some one of the following conclusions holds. In any event we have 
v1 +72+73 = 61 +62 + 63 and max{n1, n2,n3} = max{m,m2,m3}. 


(1) (ng, M2, 1, 35 V2, 71) = (m3, m2, m1, 53, 62, 61) 

(2) no = nz and (ng, 2,1, 73,2; 71) = (M3, M2, M1, 63, 62, 61) 

(3) ng = ng and (ng, 2,1, 73; 2,71) = (M3, M2, M1, 02, 63, 1) 

(4) nz =N2 =n, and (n3, 22,71, 35 72) 71) = (m3, m2, m1, 41, 62, 63) 
(5) (ng, N2, 73,2) 71) = (m3, M2, 63, b2,61) and 73 A y2 and ny #A my. 
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Proof. We assume without loss of generality that n3 > m3, If we had n3 > m3 
we would have 2°78 + 202 + 2971 + $3%mz, + 02%m, + 612m, < 6:20 < 
2QOMS OMS 4 2OM2 4 90M 4 ver. + Y20n, + 12n,, a contradiction. Thus we 
must have n3 = m3. If also y3 = 63 we have 2°"? + 20! + yan. +912n,; = 
202 4. 2° Jo am +01%m, 80 Lemma 2 applies and yields conclusion (1) or 
conclusion (2). 

Thus we assume 73 4 63 and assume without loss of generality that 
v3 = 1 and 63 = 0. Then x,, = 202 — 202 4 20™1 — 2°™ 4 doamg — 
Y2En_ + 612M — 712n,. We observe that if we had mz < ng we would have 
Ing < 4-202 — Qen2 <Q, 

Consequently mz > n2. We claim in fact mz = n2 so suppose instead that 
ms > tig. Then, < 4:20 = 202t? and a,,, > 202 — 4.202 > Qoma-t, 
But there is some r € N such that 2°°t? < rp, < 2°7+°~?, a contradiction. 
Thus mz = nz as claimed. Consequently we have z,,, = 207! — 207 + (2 — 
Y2)€nq a b12my —%71en,- 


Case 1. 62 = 2. Then we have rn, = 2°71 — 2°"! +62%m, —J14n,. Reasoning 
as above we conclude m; = ny. Then ¢n, = (61-71) tn, $0 61 = 1, 71 = 0, 
and n3 = ,. Then conclusion (4) holds. 

Case 2. b2 4 72. We claim that we must have 62 = 1 and 72 = 0. To see 
this suppose instead 62 = 0 and y = 1. Then x, = 2°? — 20% — 
Ln. + 612m, — Vin, One cannot have n; > m, for then one would have 
In, < 0. If we had ny = m we would have tp, = —Xn, + (d1 — V1) an,- 
Since z,, > 0 one would have to have 6; = 1 and 7, = 0. But then one 
would have 2, + %n, = %p, forcing zp, to be even. Thus one must have 
ny<my. 


Now we claim that x, > 2°1—1. Suppose instead that rn, < 20171, 
Now tng 222°™ and fordomée 72"? < 2, <2?" soa, <2, 
That is 2° — 2° —9,, + 1am, —V12n, < 2°? so 2°™ + byam, < 
gam—2 + gan, + Di, + M24 < gam—2 + gami—2 + gami—1 = 20m | a 
contradiction. Thus we have zp, > 201~1. 

But now for some s we have 2+? < ay. < 2+? g0 ay, > 2042, 
But then we have tp, = 2°! — 2°™ — ay, + 610m, — YiEn, < 2° + 
b1fm, — 201 +2 < 0, a contradiction. Thus we have established that 52 = 1 
and 72 = 0. 

Then we have that t,, = 20 — 2°"! + an, + 612m, — Y1Ln,. Since 
Lng,Lmo,Lm,, and Lp, are all odd we conclude 6; = 7. If also m; = n1 we 
conclude that tn, = Lm. SO N3 = M2 = Nz and conclusion (3) holds. Thus 
we assume m1 # 71. In this case conclusion (5) holds. 


We now begin an embarrassingly long sequence of computational lemmas. 


Lemma 4. Ifa,b € Ay U Ao thena+b¢ D(B). 
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Proof. Suppose a,b € A;U Az anda+b € D(B). Then a+b is even soa+beé€ 
Aq U As. Pick s <r and 6 € {0,1} such that a + 6 = 207 — 2% 4 (a, — 2). 
We consider three cases. 


Case 1. a,b € Ay. Pick n,m € N such that a = x, and b = z,,. Then 
Ln +m + 2% + dx, = 2° + dx, so adding 2°" + 2°” to both sides we 
get by Lemma 3 that 1+ 1+ 6 =, a contradiction. 

Case 2. a,b € Ag. Pickm < nandl < k such that a = 2°"%+2,,—2°™” and b= 
20k +e, —2%, Then 2°" + 2, —2° 4.2% 4.4, 2% = 207 98 4 §(x,—25) 
so 207 1 90h 4 908 Lo top tho, = 20° +20 190 4 So so by Lemma $, 
1+1+6=0, a contradiction. 

Case 3. Not case 1 or case 2. Without loss of generality a € A; and b € Ag. 
Pick n such that a = x, and pick | < k such that b = 20k 4 7, —2% Then 
Ln +2°* + 2, —2% = 2° _2°° 4 5 (x, — a5) So we again get a contradiction 
using Lemma 3. 


Lemma 5. If a,b € Az, thena+b¢ D(B). 


Proof. Pick n > m and k > I such that a = 2°" — 2° — x, and b = 2°* — 
2” _ 7. Suppose a+b € D(B), in which case since it is even, a+b € AqgUAs. 
Pick 6 € {0,1} and s < r such that a + b = 2°" — 2°% + d(x, — x,). Then 
gon 4 90k 4 9084 §¢, = 20r 4+90m 4 Ql 4 7 4-4, +62, so that by Lemma 3, 
6=1+41+ 6, a contradiction. 


Lemma 6. Let m <n andl < k be given and let a = 2°” — 2°" — az, and 
b= 2% _ 2% Tfa+be D(B), thenl=n. 


Proof. Since a+ b is odd we have a+b € A; ora+b€ Ap ora+b€ A3. 
We show first that the first two possibilities cannot hold. Indeed if we had 
a+b € Aj, then for some r, 2°" — 2° — a, + 2°* — 2% = a, so that 
gon 4 gak 4 gar _ gam 4 gal 4 9ar4 4 4, so that by Lemma 3, 1+1=0. 
A similar contradiction is obtained from the assumption that a+b € Ag. 
Thus we may pick s < r such that a+ b = 2% — 2% — ,. Then 2% + 20% 4 
2°S +7, = 207 420m +9 + x. By Lemma 3 we have that max{n,k, s} = 
max{r,m,l}. Since 1 < k < max{n,k,s} we have 1 4 max{r,m,1}. Similarly 
m # max{r,m,l} and s 4 max{n,k,s}. Thus max{r,m,/} =r. Assume first 
k <n. Then n = max{n, k,s} son =r so 2% 42% +, = 207 49% +o. By 
Lemma 2 we have max{k, s} = max{m, 1}. Since 1 < k we havel 4 max{m, |} 
so 1 <_m so conclusion (2) of Lemma 2 cannot hold. 

If we had k < s we would have (m,1) = (s,k), while 1 < k. Thus s <k 
so (k,s,0,1) = (m,1,1,0), a contradiction. Thus we have n < k so that 
k = max{n,k,s} and hence k = r. Then 2°” + 2° +7, = 2° 42% 4 ay. 

By Lemma 2 max{n,s} = max{m,l}. Since m < n we have m # 
max{m, 1} so (l,m) = (n,s) or (l,m) = (s,n). The latter is impossible since 
m <n so in particular n = I. 
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Lemma 7. Let 1 < k andm < nin N be given with k > n and let p, 
7 € {0,1}. Let a = 20* —2% 4. r(4,, —2) and let b = 2° —2°" + pi(an —tm) 
and assume that a+b € D(B). Then some one of the following holds: 


(1) n=l and wp =7; 

(2) n=1 and 4 =0 andr =1 and there is some v < m such that x, — 2 = 
poe — 20m — 20” 

(3) n<l and w=1 andr =0 and there is some v > m such that x, — 2) = 
2°" 4 ry — Im; ifn <l, thenv =n; or 

(4) n<land w=r=0 and x, — x = 2° — 20”, 


Proof. Since a + 6 is even we must have a+b € A, U As. So pick r > 
s in N and v € {0,1} such that a+b = 2°7 — 2° + v(x, — x,). Then 
gok 490m 4 908 4 ra, 4 PEn + VL, = 2° + Qe 490M yg, 4+ ray + LLm.- 
By Lemma 3, max{k,n,s} = max{r,l,m}. Sincel << k,m <n, and s <r we 
have max{r,l,m}=r and s 4 max{k,n,s}. Since k > n,k = max{k, n, s}. 


Case 1. n> s. If we had m > 1 we would then havek >n>sandr>m>l 
so that by Lemma 3 we would have (k,n) = (r,m) while m < n. Thus 
1>m. We then have k > n> s andr >I1>mso by Lemma 3 some one 
of the following holds: 


(a) (k,n,s,7, u,v) = (r,l,m,v,7, 1), 

(b) n=s and (k,n,8,7,p,v) = (r,l,m,v, U,7), 

(c) k=n and (k,n, 8,7, u,v) = (r,1,m,7,v, 1), 

(d) k=n=s and (k,n,8,7, u,v) = (r,1,m, ,T7,V), or 

(e) (k,n,7, u,v) = (r,1,7,v,), andr 4p ands 4m. 

If 44 = 7 we have that conclusion (1) of the current lemma holds. So assume 

pp #7. This eliminates (a) and (d) above. The fact that m <I eliminates 

(b) above. The fact that | < k eliminates (c) above. Thus we have (e) must 

hold. Observe also that 7 4 v. (If so one would have 20°°+2°+pa,+v2, = 

Qe 4 90M 4 rey + LLmp, So that by Lemma 2 one would have m = s, which 

is forbidden by (e).) 

There are thus two possibilities. First one could have wy = v = 0 and7 = 1. 

In this case 2°° + a, = 2°” + a, so 2° — 2% =a,-2, >0s0s<m 

and conclusion (2) of the current lemma holds. Second one could have 

f=v=1andrT=0. In this case 2° + 4 +4, = 2°" + 2, + 2m So that 

LE — @ = 2% — 2°" 4 x, — x, and conclusion (3) of the current lemma 

holds. 

Case 2. n < s. Since s < r= k we have then k > s > n. By Lemma 3 we 
then have that (k,s) = (r,l) or (k,s) = (r,m). Since m <n, the latter 
alternative is impossible and hence m < |. Also! < k =r so we have r > 
1 >m. Since n 4 m we have only one possibility from Lemma 3, namely 
that (k,s,7,v,~) = (r,1,7,v, 4) and rt # v. Since k = r and s = 1 we then 
have 2°°4+7a,+pty,tvax, = 2°"+vaezpt+rarit+urm. Suppose T = 1. Then we 
have vy = 0 so &% — a = 2° — 2° + U(&m — Ln) < 0, which is impossible. 
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Thus + = 0 and v = 1 and hence ay — a = 2%" — 2°" 4+ plan — 2m). 
If w = 1 this gives conclusion (3) of the current lemma while if 4 = 0 it 
gives conclusion (4). 


Lemma 8. Assume a > b > c and {a,b,c} C AyU As and {a + b,a + 
c,b+c,a+b+c} C D(B). Then there existk >1>m->s inN such that 
a = 20% — gal ph — gel __ 90m and ¢ = 20™ — 2°S = x, — 2. 


Proof. Since a, b, and c are in Ay U As we have k > 1,n >m, andr >s in N 
and 7, p, v in {0,1} such that a = 2%* — 2% + r(x, — 2;),b = 20" — 207 + 
[b+ (pn — Lm) and c = 2°7 — 2° +1 - (x, — as). Since a > b > c we have 
k>n>r. Applying Lemma 7 to a+ b we have one of: 


(1) n=l and p =7; 

(2) n=l and « = 0 and 7 = 1 and there is some v < m such that x, — a) = 
Doe = ees 

(3) n <l and u« = 1 and 7 = 0 and there is some v > m such that x, — a = 
207 — 29M 4 4, — Im; ifn <1, then v =n; or 

(4) n<land w=7r =O and vy — 2 = 2°" — 20. 


Applying Lemma 7 to 6+ c we have one of: 


(1)' r=mand v= p; 

(2)’ r= mand v = 0 and pw = 1 and there is some t < s such that 2, —2p, = 
gas _ gat 

(3)' r<mandv=1 and uw =0 and there is some t > s such that 2, —2m = 
20 99S 4 oy, — a, ; if r <m, then t =7T; or 

(4)' r<mandy=p=0 and ay — Lm = 2° — 2%, 


Now from (1)’, (2)’, (3)’ and (4)’ we see that in any event r < m and from 
(1), (2), (3) and (4) we see that n < 1. Thus r < m <n <l. Thus applying 
Lemma 7 to a+ c we have one of: 


(3)* r<landvy=1and7 =O and a, — a = 2°° — 2% + 2, — 2, ; or 
(4)* r<landvy=7T=0 and az — 2 = 2° — 2%. 


We show first that (1) must hold. From (3)* or (4)* we conclude + = 0 
so (2) cannot hold. 

Now suppose that (3) or (4) holds and pick v > m and y € {0,1} such 
that x,—ax, = 2°” —2° +-7-(xy—2m). Since (3)* or (4)* holds pick A € {0, 1} 
such that x, — a = 2°" — 2°° + A- (x, —a,). Then 2° + 2° 4+. ya, + Ars = 
242° + Nx, +yax5. Since s < rand m < v we conclude from Lemma 2 that 
(v,s) = (r,m). But we have already observed that r< msor<m=s <r, 
a contradiction. 

We have thus established that (1) holds. In particular we know pp = 7 
from (1) and 7 = 0 from (3)* or (4)* so up = 7 = 0. We now show that (1)’ 
holds. Since 4 = 0 we know (2)’ cannot hold. 
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Since (1) holds we know that a = 2°* — 2 and b = 2™ — 22” so that 
at+b+e= 2% — gem 1 gar _ 908 1 yy. (x, — 25). Alsoa+b+c € AsU As so 
pick w > uin N and p € {0,1} such that a+b+c¢ = 2°Y —2% 4 p- (tw —2u). 
Then 20% + 997 + 904 4 yg, + pay = 20M +20 4 999 + ox, + vas. Now 
max{k,r,u} = max{w,m,s} and m < k and s < r so w = max{w,m, s}. 
Also m >r > sso we have w >m > s. Since r<m<k and u < w we have 
k = max{k,r,u}. Thus k = w. We suppose (3)’ or (4)/ holds and consider 
two cases. 


Case 1. m =r. Then (4)’ cannot hold so (3)' holds and hence v = 1. We also 
conclude that r > wu. (For if r < u then by Lemma 3 we have (k,u) = 
(w,m) som =u >r=m.) Now since w > m > s the only possibilities 
in Lemma 3 are for conclusion (1) or (5) to hold. If conclusion (1) held 
we would have (k,r,u,0,1, p) = (w,m, s, p,0,1) which is impossible. Thus 
(k,r,0,1,p) = (w,m,0,p,1) so p = 1. Thus we have 2% + 2, + 2%, = 
2° + ap,+ 05 80 Le —- Lm = 2% — 2% + a, — 2,5 and hence u > s. 
Also by (3)! pick t > s such that rp) — tm = 2° — 2°% + a, — as. Since 
vy = 1, (3)* holds so we have x, — a, = 2° — 2% +4, —a,5. Sincel =n 
we then have 2, — tm = 2° +207 —2-2% +4, +2, —2-25. Thus 
20% 2° 4 ye — a, = 2% + 2°T —2.2° 4 4, +4, — 2-25 so that 
204 499 4m ta, = 20% 429° 4 2, + 2,. Thus by Lemma 2 we have 
(u, s) = (t,r) or (u, s) = (7, t). 

But r > s and t > s, a contradiction. 

Case 2. m > r. Then from (3)’ or (4)’ we have that xr, — @m, = 2° — 
2°° +y- (at, — 25). Now w > m > s and (w,m) 4 (k,r) so by Lemma 3 
we must have k > u > r. Since s < r we must then have conclusion 
(5) of Lemma 3 must hold and consequently p # 0, i.e, p = 1. Thus 
2°° + ep + 2m = 2° +vax,+x,~ so that x,—-—Ly = 2°" — 2° + y- (x, — as) 
SO LE — Lm = Ln — Lm and hence k = n. Since n = | < k, this is a 
contradiction. 


Thus we have established that (1)’ holds. Thus 4 = 7 = v so (3)* does 
not hold so (4)* holds. The conjunction of (1), (1)’, and (4)* is precisely the 
conclusion of this lemma. 


Lemma 9. Let a1,a2,a3, and ag in N be given such that FS(({an)4_,) C 


D(B). Then there is some i € {1,2,3,4} such that a; € A; U Ao and {a; : 
gj € {1,2,3,4} and j 4i} C A,UAsS. 


Proof. Suppose first that {a1,a2,a3,a4} C A,U As and assume without loss 
of generality that ay > az > a3 > a4. Applying Lemma 8 to aj, a2, and a3 
we pick k > 1 > m > s in N such that a, = 2°* — 2, ag = 2% — 2°™, and 
a3 = 2°" —2°°, Applying Lemma 8 to aj, a3, and a4 we conclude that m = I, 
a contradiction. 
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Now by Lemma 4 at most one i has a; € A; UA» and by Lemma 5 at most 
one 7 has a; € Ag so to complete the proof it suffices to show that no a; € A3. 
Suppose we have some a; € A3 and assume without loss of generality that 
aye As. 


Case 1. Some 7 has a; € A; U Ag. Without loss of generality az € A, U Ag. 
We may further assume without loss of generality that a3 > a4. Since 
FS(< a, + a2,a3,a4 >) C Aq U As we have by Lemma 8 some k > 1 > 
m > s such that a3 = 2°* —2™ and aq = 2% — 2%, (If a1 + a2 is between 
a3 and a4 we have 1 > m. Otherwise equality holds.) Pick u > v in N such 
that a, = 2%“ — 2° —z,. Since a; +a3 € D(B) we have by Lemma 6 that 
| =u. Since a; + a4 € D(B) we have by Lemma 6 that s = u. But s < l, 
a contradiction. 

Case 2. {a2,a3,a4} C A,UAs. Without loss of generality ag > a3 > a4. Then 
by Lemma 8 we have some k >1 > m > s such that ag = 2°* — 2” a3 = 
gu _ 20m and ag = 2% — 2°°. Applying Lemma 6 to (a1, a2) and (aj, a4) 
we again get | = u = s, a contradiction. 


We temporarily abandon our assumption that a has a fixed value in order 
to state the next theorem. 


Theorem 8. Let ¢ > 0 be given. There is a set B C N with d(D(B)) > 
1/2—e such that no a1, a2, a3, a4, and as have FS((an)?_,) C D(X). 


Proof. Pick a € N such that 1/2°~° < e. Define B as in Definition 1. Observe 
that A; C D(B) and d(A,) > 1/2—1/2%~5 since |A, N{1,2,...,2%¢+9-?}| > 
a (2etroae _ Dee), 

Suppose now one has ay, a2, a3, a4, and as with F'S((a,)>_,) C D(B). 
Applying Lemma 9 first to a1, a2, a3, and a4 one has without loss of generality 
that a, € A,UAp2 and {a2, a3, a4} C A,gUA5. Applying Lemma 9 to ag, a3, aa, 
and as one sees that a; € A; U Ag. Then applying Lemma 4 to a; and as one 
obtains a contradiction. 


We close with two questions which are raised by Theorems 7 and 8. 


Question 1. Jf B C N and d(D(B)) > 0, must there exist a1, a2, a3, and 
a4 inN with FS((an)4_,) C D(B)? 


Since always d(BN(B—t)) > 2-d(B)—1, one easily sees that if d(D(X)) > 
1—1/2™~!, there will exist a), a2,...,@m with FS((a;)™,) € D(X). (See [8, 
Theorem 4.5].) To utilize this to obtain F'S((an)?_1) one needs d(D(X)) > 
1—1/16. 


Question 2. If d(D(X)) = 1/2 or even if d(D(X)) > 1/2 must there exist 
a1, a2, a3, a4, and as in N with FS((an)2_,) C D(X)? 
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Summary. We prove that a maximum subset of {1,2,...,n} containing no 


solutions to x + y = 3z has [§] elements if n ¥ 4, thus settling a conjecture 


of Erdés. For n > 23 the set of all odd integers less than or equal to n is the unique 
maximum such subset. 


1. Introduction 


Many classical problems in computational number theory focus upon subsets 
S of positive integers with the property that for all x, y, z in S, we have 
x+y # kz, for a fixed positive integer k. The history can be dated from 1916, 
when Schur [5], in work related to Fermat’s Last Theorem, proved that the 
set of positive integers cannot be partitioned into finitely many sum-free sets, 
ie., sets having no solution to x+y = z. This result is a Ramsey-type theorem 
which predates Ramsey’s Theorem. In 1927, van der Waerden [9] considered 
subsets having no solution to x+y = 2z, or in other words, subsets containing 
no three-term arithmetic progression. His celebrated theorem states that the 
positive integers cannot be partitioned into finitely many subsets each of 
which contains no k-term arithmetic progressions. In 1952, Roth proved that 
a set of positive upper density contains three-term progressions [4]. This was 
improved by Szemerédi to four-term progressions [7] and later to the general 
k-term progressions [8]. 

A problem which appears in several undergraduate combinatorics texts 
is to show that a maximum subset of {1,...,} containing no solutions to 
r+y = z (a,y,z not necessarily distinct) has size | 4]. In Sect. 2 of this paper 
we find all such subsets and prove the following theorem: 


Theorem 1. A maximum subset of {1,...,n} containing no solution to x+ 
y =z (x,y,z not necessarily distinct) has size [}]|. If n > 3 is odd there are 
precisely two maximum subsets: the odd integers less than or equal to n and 
{x EZ | ntl <a<n}. Ifn > 4 is even there are at least three maximum 
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subsets: {a € Z | ual <a <n} and the two maximum subsets for the odd 
number n— 1. For even n > 10 these three are the only ones. For smaller 
even numbers, {1,4}, {2, 5, 6}, {1, 4,6}, and {2,3,7,8} are the only additional 


ones. 


Let f*(n,2) denote the maximum size subset of {1,...,n} containing 
no three-term arithmetic progressions (such subsets contain no solutions to 
x+y = 2z, but now, for the problem to make sense, z, y, z are distinct). Roth 
first showed [4] that 


r= 0 rates) 


The current best bounds are, for appropriate absolute constant c;, 
con 


—aVlEh © f*(n 9 
ne < f*(n,2) < (les n= 


where the lower bound was proved by Salem and Spencer [6] (see also 
Behrend [1]), and the upper bound was proved by Heath-Brown and 
Szemerédi [3]. 
Erdés conjectured that a maximum subset of {1,...,n} having no 
solutions to x + y = 32 (a,y,z not necessarily distinct) has size no more 
n 


than a small constant more than | $]. In Sect. 3 we verify this conjecture by 
proving the following theorem: 


Theorem 2. Let T,, be a subset of {1,...,n} of maximum size such that 
x+y = 32 has no solutions with x,y,z © Tn (z,y,z not necessarily distinct). 
Ifn #4 then |T,| = | ¥]. 


In Sect. 4 we show that for sufficiently large n there is a unique maximum 
such subset: 


Theorem 3. [fn > 23 and T,, is a subset of maximum size of {1,...,n} 
having no solutions to x+y = 3z then T,, is the set of all odd integers less 
than or equal to n. 


We use the standard notation | ] and | | for least integer not less than 
and greatest integer not greater than, respectively. For a and b nonnegative 
integers we let [a, 6] denote the set of all integers x such that a < x4 < b. 


2. Maximum Sum-Free Sets of {1,...,n} 


Proof of Theorem 1. First we show the maximum size is always | 5]. Let 
U, be a maximum sum-free subset of {1,...,n} and let p be the largest 
integer in U,. Then at most one integer in each of the pairs (i,p — 7), i = 
Ly 2 yates [257] is in Un, so |Un| < [$] < | $]. Clearly, there are subsets which 
attain this bound, so |U;,| = |} ]- 
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To characterize the maximum subsets we consider two cases depending 
on the parity of n. 


Case 1 (n odd). Let n > 5 be the smallest odd integer such that there exists 
a maximum sum-free subset U,, of {1,...,} which is not the odd integers 
less than or equal to n or [2£4,n]. Clearly n € U;, (or else |U;,| < [254] < 
[51), soifn—1¢U,, then, by the minimality of n, either U,, is the set of 
all odd integers less than or equal to n (which is impossible by assumption) 
or U,, = [25+,n— 2] U {n} which is impossible since 45+ + 244 = n. So 
we can assume n — 1 and n are in Up. 

Let G be the graph with vertex set V = {u; | i € [2, 452]U[2*, n—2)} 
of size n — 4 where {v;,v,;} is an edge of G if and only ifi+j =n or 


i+j=n-—1. Then G is the path vp_2, v2, Un_3,U3,-- +) Un=3,Unt1 (with 


an odd number of vertices). Since n—1 and n are in U,, 1 and not are not, 
so the other nos integers in U,, must be the indices of an independent set 
of vertices in G (i.e., no two of them adjacent). The only sufficiently large 
independent set in G is the maximum independent set which has indices 
[pt n— 2], so Up, = (24, n]. 

Case 2 (n even). Ifn > 4 is even and n ¢ U;,, then certainly U,, must be one 
of the two maximum subsets for the odd integer n — 1. It is easy to check 
that the statement in the theorem about when n is 4, 6, or 8 is correct. 
Let n > 10 be the smallest even integer such that there exists a maximum 
sum-free subset U;, of {1,...,n} which contains n but is not [$+ 1,n]. If 
n—1 € Un, we let U,_-2 = U,N[1, n—2], so that |U,—2| = ne? n—2 cannot 
be the odd integers less than or equal to n — 3 because 3 + (n — 3) = n. 
And U,,—2 cannot be [252,n — 3] or [2,n — 2] because 4 ¢ Un. So Un—2 
cannot be any of the three kinds of maximum subsets for even n described 
in the theorem. So by the minimality of n we would have to have n = 10 
and U,-2 = {2,3, 7,8}. This cannot be because 2+ 8 = 10. Hence, as with 
the odd case, n — 1 and n are both in U,,. 

Now let H be the graph with vertex set V = {u; | i € [2,"52] U 
["t? n — 2]} of size n — 4 where {v;,v;} is an edge of H if and only if 
i+j=norit+j=n-—1or {i,j} = {n—2, 25%}. Then H is the cycle 
Un—2; U2, Un—3, U3) +++) Ung, Un=2- Since n — 1 and n are in Uy, and 5 are 


not, so the other we integers in U,;, must be the indices of an independent 


set of vertices in H. There are two possibilities: [2, 257] and ["42,n—2]. If 
n > 10 the first of these cannot occur because it contains 2 and 4 (If n = 6 
the first possibility gives us {2,5,6} while if n = 8 it gives {2,3,7,8}.). So 
we have U,, = [$ +1,n] which completes the proof. 
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3. The Size of a Set with No Solution to x + y = 3z 


We observe ua if Ty, is a set containing no solutions to x + y = 32 and if 
w € Tn, then } xu, 2u, sw, and 2w cannot be in T;, (because x,y, z need not 
be distinct). 


Proof of Theorem 2. The set of all odd integers less than or equal to n has no 
solutions to «+ y = 3z so |T;,| > [ }]. It is easy to check that |T;,,| = [}] for 
n = 1,2,3,5 (there are three ways to choose Ts: {1,3, 4}, {1,3, 5}, or {1, 4, 5}. 
The first of these shows that |7Z4| = 3.) So it remains to show |T,| < [3] 
for n > 6. Let n be the smallest integer greater than or equal to 6 such that 
|T;,| > [5]. We can assume n is even and n € T,, (otherwise n —1 is a smaller 
counter-example). 


Case 1. T,, has no integer x such that | <x“ < 7 By the minimality of n 


at most [| %]/2] of the integers in [1,|#|] are in T,, provided |#| 4 4. So 


fel +| [alee] 


. ifn =0or2 (mod 6) 


[Tr 


IA 


5 +1 ifnm=4 (mod 6). 


So we are done if n # 4 (mod 6) and [3] 4 4. If [3] =4 then n = 12 

or n = 14 and the respective candidates for a set of size greater than > 

are {1,3,4,9,10, 11,12} and {1,3,4, 10,11, 12,13, 14}. However, neither is 
acceptable because 1+ 11 = 3-4. 

It remains only to consider n = 6k + 4 (k = 1,2,3,...), in which case 
the candidate for a counter-example is to choose k + 1 of the integers in 
[1,2k + 1] and all the integers in [44 + 3,6k + 4]. If 2k +1 € T, then 
more than half of the first 2k integers are in T;,, so 2k = 4, n = 16, and 
the candidate is {1,3,4,11, 12,13, 14,15, 16} which fails again because it 
contains 1, 4, and 11. So 2k+1 € T, and 6k +3 is a forbidden sum. Since 
[4k + 3,6k + 2] C T), it follows that T;, 9 [1,2] = 0. This is impossible 
since k + 1 of the first 2k + 1 integers are in T),. 

Case 2. T,, has an integer x such that | <4 < an 


In fact x A > 22 since n € T,. Assume « is the largest integer in T;, such 
that F<a2< ari . Then the integers in W = [3a — n,n] can be arranged in 
pairs as follower 


3 
(32 -—n+j,n—35) j=0,1,2,...n— =| 


Since the sum of the integers in each pair is 3x, at most one integer from 
each pair can be in T),. 
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If x is even then |W| is odd and one of the pairs is (2%, 3). In this case 
Tn contains at most $(|W]|— 1) integers from W and, by the minimality of 


n, at most no |W) +t integers from [1,32 —n— 1]. So |T,| < $. 


If x is odd, then |W| is even and at most $|W| integers from W can be in 
Tn. So at most $(n—|W]) integers from [1,3a—n—1] can be in T;, provided 
32 —n—1 #44. So we are done except for the possibility that 32 —n = 5. 


In this case one of the pairs of integers in W is (24+, 24++) = (244, 248). 
Since x is the largest integer in T,, which is less than ah and since 7 = 
neo < nit < 246 we must have “4° > 2% (so that one integer in this pair 
can be in T;,). Solving this gives n < 18. Since n > 6, the only possibilities 
aren = 10, 7 = 5 andn= 16, x = 7. The first of these is impossible because 
n € T, but TJ, cannot contain both 5 and 10. For the second possibility, since 
7€T, certainly 14 ¢ T,, so the only candidate is {1,3,4,7, 11,12, 13,15, 16}. 


But T;, cannot contain 1, 4, and 11 so the proof is complete. 


4. Maximum Sets with No Solutions to z+ y = 3z 


Choosing lots of smaller integers to go into a set T;, which has no solutions 
to «+ y = 32 clearly eliminates some of the larger integers from inclusion. 
If the smaller included integers follow a simple pattern it may be possible to 
get a simple description of the eliminated larger integers. 


Lemma 1. Let w be an odd integer greater than or equal to 3. If T is a set 
which contains no solutions to x + y = 3z and if T contains all odd positive 
integers less than or equal to w, then T contains no even integer less than 3w. 


Proof. The result is easy to verify for w = 3. If w > 5 and v is any even 
number less than 3w, then (precisely) one of the integers v + 1,v+3,v+5 is 
equal to 3t where t is an odd number less than or equal to w. 


In proving Theorem 3 we will make frequent use of the maximum size 
subsets of {1,...,} with no solutions to «+ y = 3z for n < 22. We have 
calculated them all and display them for even n between 6 and 22 inclusive. 
To get all such maximum subsets for n = 2p — 1 for p = 3,4,...,11, just 
choose the ones for n = 2p which do not include the integer 2p. 


Proof of Theorem 8. Suppose the theorem is false and let n > 23 be the 
smallest counter-example with T;, a subset of {1,...,n} of size [4] which 
contains an even integer. It is easy to see that 23 cannot be added to any 
of the maximum subsets of {1,...,22} listed in the table without producing 
a solution to x+y = 3z, so n > 24. By the minimality of the counter- 
example we can assume n is even and n € T;,. We divide the proof into cases 
(and subcases) along the lines of the proof of Theorem 2. 
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Case 1. T, has no integer x such that } <a < 7m Let y be the largest 


integer in T;, such that y < %. Then 3y is a forbidden sum and at most 
one member of each pair (7, 3y — 7) i = 1,2,...,y can be in T,,. Since T;, 
has no integers strictly between y and 2y 

n 

5 Tal Sy +[[8y, 2] = — 2y +1. 
So, y< L4(n + 2)| and, since T, N [y+ 1, [2] = @, 


5 Slt bull + [5 


The only even solutions greater than 22 for this inequality are n = 
26, 28,34 with corresponding values y = 7,7,9 respectively. If n = 34 
and y = 9 then T;,, must contain five of the first nine integers, which 
(see Table 1) must be {1,3,5,7,9}, and everything in [23,34]. Since 
24+ 3 = 3-9 this cannot happen. If y = 7 and n = 26 or n = 28, 
then T,, contains everything in [18, 26] or [19, 28] respectively. But 7 € Tp, 
so 21 is a forbidden sum, and, since 19 and 20 are in T,, (in both cases), 
neither 1 nor 2 can be in T),. Since four of the first seven positive integers 
must be in T;, this is a contradiction (see Table 1). If y = 4, then |T,,N[1, y]| 
could be as much as 3 in the above inequalities, but } < 3+ [3] has no 
solutions for n > 24. So Case 1 cannot occur. 
Case 2. T,, has an integer y such that 


5 ens (1) 


Let x be the largest integer in T;, satisfying (1). Then 32 is a forbidden 
sum and the integers in W = [3a — n,n] can be arranged in pairs 


((32| — i, [4] +2) i=0,1,...,n—[#] such that the sum of the integers 


in each pair is 3a. If x is even then a is paired with itself. All other pairs 


(for x even or odd) have distinct integers. 
If x is even then, because of the above pairing, 


W|-1 
nw) < WI (2) 
But by Theorem 2, 
32—n-1 —|W|+1 
T,(1,32—n <|* > )=* wht (3) 
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Table 1 Maximum subsets of 1,...,n with no solutions to x + y = 3z 
for 6 <n < 22 (except the set of all odd integers less than or equal to 7) 
n Th 
6 134 156 
145 256 
8 1347 1568 
1567 2568 
2567 1678 
2678 
10 134710 178910 
137910 278910 
147910 378910 
12 13471012 139101112 
14 1391011 12 13 13101112 13 14 
139101112 14 341011 12 13 14 
1341012 13 14 
16 13471012 13 16 1341213 14 15 16 
3471112 13 15 16 131112 13 14 15 16 
13471213 15 16 3411 12 13 14 15 16 
1371112 13 15 16 
18 13412131415 1617 13413 14 15 16 17 18 
20 134131415 1617 18 19 1341415 16 17 18 19 20 
13413 14 15 16 17 18 20 


22 1347 10 12 13 16 19 21 22 135 15 16 17 18 19 20 21 22 
13415 16 17 18 19 20 21 22 145 15 16 17 18 19 20 21 22 


Since = $, equality must hold in (2) and (3). So T, 
contains precisely one integer out of each pair of distinct integers above. 
And T,, 9 [1,32 —n— 1] must be a maximum subset of [1,32 — n — 1] 
containing no solutions to «+ y = 32. 

If x is odd then |W| is even, so T;, must contain precisely one integer 
out of each pair of W and T,,N[1, 3e—n—1] = 242+, unless 3a—n—1 = 4. 
If 3a —n—1= 4 then it would be possible to have T), 1 [1,4] = {1,3, 4} 
and T), contain precisely one integer from all but one of the pairs and no 
integer from that one pair. 

Subcase 2a. [2%4*) < 2n. 

If x is oe then both integers of the pair (32 — 1, 3! +1) are less than 
or equal to 3 qn. Since x is the largest integer in T, qhiah is less than or 
equal to 2 any ieee integer in this pair is in T,, which is a contradiction. 

If x is ion. neither integer in the pair ([32], [32]) can be in T;,, which 
again is a contradiction unless 3x —n — 1 = 4. But since n > 24 


n—|W|+1 , |W-1 
2 + 2 


sr+3 _ nt+8 — 2. 
2 2 = 3” 


so neither integer in the pair (|| — 1, [32] + 1) can be in T;, either, so 
Subcase 2a cannot occur. 
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Subcase 2b. Assume the two following inequalities: 


=e] >2n (4) 


and 
3x —n > 23 (5) 


Since T,,[1, 3e—n—1] is a maximum subset of [1, 32—n-—1] containing 
no solutions to 7+y = 3z and since 23 < 3x—n-—1 < n, by the minimality 
of n as a counter-example, T,, 9 [1,3a —n— 1] must be the set of all odd 
integers less than or equal to 3x—n—1. By Lemma 1, T;, contains no even 
integer less than or equal to 68, so n > 70. 

If x is even, then inequality (4) becomes 


4n—6 
. 6 
re (6) 


And 32 — n— 1 is odd so, by Lemma 1, T,, contains no even integers less 
than or equal to 3(32 — n— 1) —1, which by (6) is greater than n — 10. 
If x is odd then (4) becomes 


4n—3 
7 
x> 9 (7) 


and T,, contains no even integers less than or equal to 3(3a —n — 2) —1 
which by (7) is also greater than n — 10. So T,, contains at most five even 
integers, and hence there are at most five odd integers less than n which 
are not in T,. Let m; = 2/ |] + 2i—1 and p; = 3m; —n, i =1,2,...,11. 
It is easy to check that {m,;} and {p;} are each sets of 11 distinct odd 
integers less than or equal to n and clearly not both m,; and p; can be in 
T,, for any i = 1,...,11. Hence there are at least six odd integers less than 
n which are not in T;,,, which shows Subcase 2b cannot occur. 
Subcase 2c. Inequality (4) holds but (5) does not. 


Hence 


2 324+1 n 
< 12 
an<| 5 )s$+ (8) 


from which it follows that n < 70. So only a finite number of possibilities 
remain to be checked, and this could be done one by one (by hand or 
computer). We prefer to avoid this by considering the following possibilities. 


2c(i). Assume (8) holds and also assume 


[S|+3ses5 (9) 
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If x is even, inequality (8) simplifies to 


(10) 


4n —6 ae n + 22 
3 

And if x is even then, as we showed before, |T,[1, 3z—n—1]| = 34". 
By inequality (9), 31 -n—1>9> 5, so 3a—n-—1 € Ty. Since 
Tr [x +1, |22)] = 0, we must have [3x — | 27] +1,2x%-1] C Th, 
since these integers are each paired with an excluded integer in the 
pairing of [3a — n,n] we discussed before (32 — |4| might not be in 
Tn because it might be equal to 32). But 3(3a —n—1) is a forbidden 
sum so T, [7x — 3n—2, 62 —3n+ || —4] = 0. Let a = 7x—-3n—-2, 
b = 62 — 3n+ |2| — 4, and c = 3c — n—1. With « satisfying (9) 
and (10) it is easy to check that a > 0 and c > 9. Since T,, N[a, b] = 0 
and c € T, we certainly have a contradiction if a < c < b. We also 
have a contradiction if a < b< cand b—a+1>c—b+2 because 
then 


1 
= = [TN [Ld] < [TaN [La-l+ (c-8) 
< ston} 
c—1 
< i 
oD 


Hence we have a contradiction if the conditions a < cand 2b > a+c+1 
are both satisfied, i.e., if 


oe 2n seg rt 
3 4 


But that is precisely our assumption in (9). 
If x is odd then inequality (8) simplifies to 
4n — 3 2 n + 23 
9 3 


(11) 


and the argument is similar. In this case it turns out that [3a — 
| #2], 2a —1] CT, and that 9x — 3n — 3 or 9x — 3n — 6 is a forbidden 
sum, but we still get a contradiction with inequality (9). 
Assume (8) holds and x > 4. 

If x is even there are eight ordered pairs of values for n > 24 and x 
which satisfy (8) when x > }. We list them as triples (n, 7, 32—n—1): 


(24,14,17) (30,16, 17) 
(26,14,15) (32, 18, 21) 
(26,16,21) (34,18, 19) 
(28,16,19) (38, 20,21) 
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Since |T, A [1,32 —n— 1]| = 24" we see (by the table) that if 
3x2—n—1 is equal to 21 or 15 then T;, must contain all the odd integers 
less than or equal to 15 and (by Lemma 1) no even integers at all. 
That eliminates four triples. Since 3x—n—1 € T;,, and z is the largest 
integer in T), less than 7m we cannot have x < 3r—n—1< 22. That 
eliminates three more triples, leaving only (24, 14,17). If3a—n—1= 
17, then 15 € T, (by the table). But « = 14 < 15 < + 24, so we get 
a contradiction here as well. 

If x is odd there are ten such triples (n, x, 3a —n— 1): 


(24,13,14) (30,17, 20) 
(24,15,20) (32,17, 18) 
(26,15,18) (34,19, 22) 
(28,15,16) (36,19, 20) 
(28,17,22) (40, 21,22) 


We will show just the argument for (32,17, 18) here. Since |2| = 21, 
we must have T;,[18, 21] = 0. Since |T;,,N[1, 18]| = 9, by the table (or 
by Theorem 2) 17 must be in T,,. Also, |T,[22, 32]| must be 7. Since 
51 is a forbidden sum, at most one integer in each of the pairs (22, 29), 
(23, 28), (24, 27), (25, 26) is in Ty. By the table 15 € Ty, so 30 € Th, 
which means |T,, 1[22, 32]| < 6 a contradiction. The other nine triples 
can be disposed of with similar (and mostly simpler) arguments. 
2c(iti). Assume (8) holds and x < [$] +3. 


The only possibility here is n = 28 and x = 12. Then 32 —n = 8, so |T,N 
[1, 7]| = 4 and T,, contains precisely one member of each pair (18 — 7, 18 + 7) 
t= 1,2,...,10. Since || = 18, 7,10 [13,18] = G, so [19,23] C T,. But 21 
is a forbidden sum, so neither 1 nor 2 can be in T),. This is a contradiction 
since (by the table) if |T,,  [1, 7]| = 4, either 1 or 2 must be in T),. 


5. Related Problems and Remarks 


For k a positive integer not equal to 2, let f(n,k) be the maximum size of 
a subset S of {1,...,n} such that there are no solutions to 2 + y = kz with 
x,y,z (not necessarily distinct) integers in S (the problem does not make 
sense for k = 2). In this paper we determined f(n,1) and f(n,3) for all 
nm and found all maximum such subsets. The determination of f(n,k) when 
k > 4has a very different flavor, and we have some results in this direction [2]. 

Let g(t) be the maximum “size” (appropriately defined) of a subset S' 
of the closed interval [0,1] having no solutions to x + y = ta where t is a 
fixed positive number. Finding g(t) is a continuous analog of finding f(n, k). 
It turns out there is a strong connection between these problems if k > 4, 
but not for k = 3. For k = 1 we remark that the maximum set [|"4+],n] of 
Theorem 1 does have a continuous analog, while the set of all odd integers 
less than or equal to n does not. 
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In [2] we show that if & > 3 then the positive integers can be partitioned 
into finitely many subsets each of which has no solutions to x + y = kz. 

For k any positive integer, let f*(n,&) be the maximum size of a subset 
of {1,...,} having no solutions to x+y = kz where z, y, z must be distinct. 
Clearly, f(n,k) < f*(n,k). It is easy to show that f*(n,1) = [2+]. There 
are many values of n for which f(n,3) is smaller than f*(n,3). For example, 
the set [1, 4] U[12, 18] shows that f*(n,3) > 11. However, we join Paul Erdés 
in conjecturing that f*(n,3) = f(n,3) = | }] for sufficiently large n. We also 
suspect there is a unique maximum set for sufficiently large n. One could 
do some computer work to obtain some information as to the likelihood of 
this conjecture being correct. To get a proof one could follow the lines of our 
proofs in this paper. Unfortunately, the minor inconvenience of f (4,3) being 
greater than 2 would become the major headache of f*(n,3) being greater 
than [| for many small values of n. 

Of course the problem of narrowing the bounds for f*(n,2) is one of the 
most intriguing problems in combinatorics. 
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Summary. We discuss some simple deterministic algorithms that establish 
primality for a robust set of primes in polynomial time. The first 6 sections comprise 
the intact original article published in the first edition of this volume in 1997. 
The last 2 sections discuss developments in this fast-moving field to early 2013, and 
refer to the prior sections in the past tense. The bibliography for the original article 
and the new update have been combined. 


1. Introduction 


In this paper we present several algorithms that can find proofs of primality 
in deterministic polynomial time for some primes. In particular we show this 
for any prime p for which the complete prime factorization of p — 1 is given. 
We can also show this when a completely factored divisor of p—1 is given that 
exceeds p'/4+®, And we can show this if p—1 has a factor F exceeding p* with 
the property that every prime factor of F is at most (log p)?/*. Finally, we 
present a deterministic polynomial time algorithm that will prove prime more 
than x'~* primes up to x. The key tool we use is the idea of a smooth number, 
that is, a number with only small prime factors. We show an inequality for 
their distribution that perhaps has independent interest. 

It is known that if one assumes the Riemann hypothesis for Dirichlet 
L-functions, then the prime recognition problem is in the complexity class P. 
Thus, from Miller and Bach we know that for every odd composite number n 
there is some integer a in the range 1 < a < min{n, 2(logn)?} such that n is 
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not a strong probable prime to the base a, and so n is proved composite. If an 
odd number n is a strong probable prime to every base a in the above range, 
then n is prime. Thus assuming the above extended Riemann hypothesis, 
every prime p can be deterministically supplied with a proof of its primality 
in O((log p)*) arithmetic steps with integers at most p. 

The results in this paper do not rely on the truth of any unproved 
hypotheses. 

It has been known since Lucas that it is easy to find a proof of primality for 
a prime p if the complete factorization of p — 1 is known. Indeed one merely 
has to present a primitive root for p and prove it is one using the prime 
factorization of p — 1. Though we know no fast deterministic algorithm 
for finding a primitive root for a prime p, the probabilistic method of just 
choosing random integers until a primitive root is found works very well 
in practice. The expected number of tries is O(log log p). In fact, one can 
show that the expected number of random choices to find a set of numbers 
which generate (Z/pZ)* as a group is O(1). (Note also that it is a simple 
matter to deterministically fashion a primitive root out of a set of generators 
with knowledge of the complete prime factorization of p — 1.) Our algorithm 
requires O((log p)!°/") tries, and does not guarantee that it will find a 
primitive root or a set of generators, but it does prove primality and it is 
deterministic. 

It is also known (see Brillhart, Lehmer, Selfridge [8]) that if one has a 
fully factored divisor F of p—1, where F > p!/3, then one can quickly 
decide if p is prime or composite. Again, this involves choosing numbers at 
random. We show how the prime or composite nature of p can be decided 
deterministically and in polynomial time. In addition, we only require F > 
p'/4+© Tn another algorithm we only need F > p®, but for the method to be 
fast, F must be smooth. 

While many of the algorithms in this paper are only of theoretical interest, 
it is likely that at least some of the ideas have practical value. In particular, 
an algorithm we present below which allows one to decide whether n is prime 
or composite, when it is known that all prime factors of n are 1 mod F with 
F > n3/19, should be a practical addition to the Brillhart, Lehmer, Selfridge 
“n — 1 test”. 

It is to be expected that some of the ideas presented here would be of 
use in the “n +1 test” and the combined “n? — 1 test”. These elementary 
tests are often used in conjunction with the Jacobi sums test (see [6] and 
references there), and it is possible that a few ideas presented here will be of 
use in that context as well. However, as stated above, our primary emphasis 
in this paper is theoretical and not practical. 

Let w(x, y) denote the number of integers n < x free of prime factors 
exceeding y. In [15], Erdés and van Lint show that in some sense (a, y) can 
be approximated by the binomial coefficient a) where u = log a/ log y 
and 7(y) denotes the number of primes not exceeding y. In fact an elementary 
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combinatorial argument shows that (x,y) > Ca), so one side of the 


approximation is easy. In this paper we obtain the lower bound 2/(log x)“ = 
giles log ai toey, which is valid whenever x > 4 and 2 < y < «a. This 
lower bound for 7(x, y) is attractive for its simplicity and near universality. 
However, one should note that it is a good approximation to (a, y) only in 
the range (log x)'** < y < exp((log x)*). The inequality in the special case 
y = (log x)? was previously established by Lenstra [19], and for a similar 
purpose. 

Our main idea in this paper is to build up a large subgroup of (Z/nZ)* 
using a small set of generators. Specifically, if p is the least prime factor of 
n and a is an integer with 1 < a < p, let G,,(a) denote the subgroup of 
(Z/nZ)* generated by 7 mod n for j = 2,3,...,a. From the above estimate 
for w(x, y), we have 


#Gn ([(logn)*]) = Y(n, (log n)*) > nb“, 


whenever n > 4 and 2 < (log n)° < p, with p the least prime factor of n. 
Thus we can create an “exponentially large” subgroup of (Z/nZ)* with a 
“polynomially sized” set of generators. The idea of using smooth number 
estimates to show that one has built up a large subgroup of (Z/pZ)* for p 
prime was first used in 1926 by Vinogradov [28] to estimate the least positive 
residue mod p that is not a k-th power. 

It was previously shown by Pintz, Steiger and Szemerédi [22] that there 
are infinitely many primes p that can be proved prime in deterministic 
polynomial time. They require for their primes p that p— 1 has a divisor 
which is a power of 3 and exceeds p'/?. Thus they could only show there are 
more than «?/3-© such primes up to x. As mentioned above, we replace the 
“2/3” with 1. 

Our result in Sect. 3 on deciding if n is prime or composite in deterministic 
polynomial time, when the complete prime factorization of n—1 is given, was 
anticipated by Fellows and Koblitz [16], though their algorithm is not as fast 
as ours. 

We mention a few other results that are somewhat relevant. Adleman 
and Huang [1] have given a probabilistic algorithm for primality proving that 
has expected polynomial time. Much earlier, Solovay and Strassen [27] had 
given a probabilistic algorithm for compositeness proving that has expected 
polynomial time. In [23], the second author showed that for every prime p 
there is a proof that p is prime that can be verified in O(log p) arithmetic 
steps with integers at most p. Previously, Pratt [25] had shown via Lucas’s 
test the existence of a primality proof that requires O((log p)?) arithmetic 
steps. These two papers show only the existence of these proofs; they do not 
show how to find them quickly. 

We wish to thank W. R. Alford, Ronald Burthe, Andrew Granville, 
Hendrik Lenstra, and Jeff Shallit for some helpful remarks. 
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2. A Lower Bound for the Distribution 
of Smooth Numbers 


We say an integer n is y-smooth if no prime factor of n exceeds y. Let 4)(x, y) 
denote the number of integers n in [1,2] that are y-smooth. In this section 
we are going to prove the following theorem. 


Theorem 1. If x > 4 and2<y <2, then y(z,y) > a8 08 #/Io8 y, 


We begin with a few lemmas. Let a(a) denote the number of primes p 
with p< a. 


Lemma 1. For x > 37 we have x(x) — 1(a'/?) > (7/9)x/log x. 


This lemma follows from Rosser and Schoenfeld [26, Theorem 1] and a 
simple calculation. The next lemma is well known; we give the proof for 
completeness. 


Lemma 2. Let p;, denote the kth prime. For x > 1 we have 


Proof. An integer n < «x which is px-smooth has its prime factorization 
in the form p{'p$?...p,z" where a1,a2,...,@, are non-negative integers 
and )*a; log p; < log x. Thus 7(z,px) is the number of lattice points 
(a1,@2,...,@~) € Z* with each aj; > 0 and Soa; log p; < log x. Putting 
each such lattice point at the “lower left” corner of a unit cube with edges 
parallel to the axes, a region is described which is strictly larger than the 
simplex 


k 
{ (1s) ER*: each y; > 0,5 oy; log p; < log oh. 
j=l 


Thus w(x, px) exceeds the k-dimensional volume of this simplex, which is 


k 
(log x) ~ I log p;)~ 


Proof of Theorem 1. We verify the theorem directly for pairs x,y with 2 < 
y < 37 and x < 120. Assume now that 2 < y < 37 and x > 120. Since the 
theorem is trivial when log y < log log 7, we may assume y > 3. It is not 
hard to show that 


2 
girls log x/ log 37 < (log x) (1) 
log 3log 4 
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for x > 120. But the left side of (1) is greater than «!—!°8 1s #/le8 ¥ and the 
right side of (1) is less than 7(x,3) by Lemma 2. Since 7)(x,3) < (a, y), the 
theorem holds in this range. 

Now assume 37 < y < x. Let u = log x/log y and let {u} = u — [u] 
denote the fractional part of u. If m is a positive integer with m < y{“} and 
n is a product of [u] not necessarily distinct primes in the interval (y!/?, y], 
then N = mn is y-smooth and N < ytv}yl“l = y“ = «x. Moreover, since 
m has at most one prime factor in (y'/?,y], it follows that the number of 
representations of N as a product mn in this way is at most [u]+ 1. In fact, if 
{u} < 1/2, then N has at most one representation as mn. We conclude that 


: yt" ](r(y) — a(y/?) MI /({u] + D!, if {u} > 1/2 9 
a iy nly) — my), ty cay. 
Note that using y > 37, we have 
, Gy4} if fu} > 1/2 
! ] - Lyf), if {u} < 1/2. ”) 


Also, from Lemma 1, we have 


Thus 


[u] U 

7u y 
{u} = 1/2))[u] aiid l {u}_ oF 
uf m(y) — aty))0 > (ZY og ay 


[wu] 
= (=) (log x) {4} giles log x/ log y 


Using this inequality with (2) and (3) we have that the theorem will hold if 
we show 


7u\ Ml : Z([uJ+1)!, if{u} > 1/2 
(5) (log 2)" 2 oe if{u} < 1/2. (4) 


We now show 


E\F 
(=) > 2(k +1)! for every integer k > 6. (5) 
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This holds by inspection for k = 6,7,8,9. For any non-negative integer k, the 
arithmetic-geometric mean inequality implies that 


k4+1 
() >(k+1)I. 


Using this and the easily verified inequality 


k k+1 
(=) +2 (“S) for k > 10, 


we have (5). Note that (5) implies (4) when u > 6. 
Suppose that 3 <u < 6 and {u} < 1/2. We verify for k = 3,4,5 that 


7k\* 
ead ok! 
(y) > 


so that (4) holds for these values of wu. 
Suppose now that 2.5 <u <6 and {u} > 1/2. We verify for k = 2,3,4,5 
that 


eee (iog(ar¥)) "> Tea 


so that (4) holds for these values of u. (We use that 7 = y“ > 37”.) 
Now suppose 2.2 < u < 2.5. We have 


(=) ” og a) > (722) (log(37?))9? > 4 = 2ful}, 


so the theorem holds here too. 
In the range 1 < u < 2.2 we use another estimate for (x, y). The number 
of integers up to x divisible by a prime p is [a/p]. Thus 


ven) 2 l— le] >e-1-2 aS (6) 


where p runs over primes. 
First assume that 1.6 < u < 2.2. Then x > 37! > 286. It follows from 
Theorem 5 in Rosser and Schoenfeld [26] that 


Diop lite hie igg yd + : 
) ay) 2 
goes 2(log x) 2(log y) 
1 1 
< log 2.24 < 0.85. 


2(log(37!-))2 " 2(log 37)2 
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Thus from (6) we have 
W(a,y) > 0.15¢—1> 0.142. 


But 
x x 
= 
(log x)" ~ (log(37!°))!: 


gi-los log a/log y _ 


5 < 0.072, 


so the theorem holds in this range. 
Finally assume 1 < u < 1.6. Then from Theorem 5 and its Corollary in 
[26] we have that 


S- ae ee : : 
ae » (log x)? " 2(log y)? 
< log 1.6 : . < 0.59 
O O04 sey 
=e (log 37)? | 2(log 37)2 


so that from (6) we have 
wW(a,y) > 0.412 —1 > 0.382. 
But 


1—log log x/ log y < v < 0.28 
. = log 37 = 


so we have the theorem here as well. This concludes the proof of Theorem 1. 


3. When n — 1 Is Fully Factored 


In this section we present and analyze two deterministic algorithms that will 
decide if a positive integer n is prime or composite when the complete prime 
factorization of n—1 is known. The first algorithm uses the Brillhart, Lehmer, 
Selfridge “n — 1 test” (see [8]). The second algorithm is somewhat faster and 
uses a new result presented below. 

We begin with a factorization algorithm that is very fast, but unfortu- 
nately is usually unsuccessful in factoring composite numbers. 


The base B factorization method. We are input integers n, B withn > B > 2. 
This algorithm attempts to find a nontrivial factorization of n. 


Step 1 Write n in the base B : n = cgB4 + cq_1B*!4+...+ co, where 
Co,---,€q are integers in the interval [0, B — 1] and cq > 0. 

Step 2 Compute c = gcd(co,...,cq). If c > 1, return c as a proper factor of 
n and stop. 
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Step 3 Factor f(x) = caw? +...+ co into irreducible polynomials in Z[2] 
with the algorithm of [18]. 

Step 4 If f(x) is irreducible in Z[z], the algorithm has failed, so stop. 
If f(x) = g1(x)go(x)...9x(x) where each g;(x) is irreducible in Z[z], then 
return gi(B)g2(B)...9x(B) as a nontrivial factorization of n and stop. 


That each g;(B) is a proper factor of n in Step 4 follows from [7]. Thus 
the algorithm is correct. From the analysis in [18], it follows that the running 
time of the algorithm is (log n)O). 

We shall only be applying the base B factorization method in the cases 
d = 2,3 and in these cases it should be considered “overkill” to use the 
algorithm of [18] to factor f(a) in Step 3. In particular, if d = 2, then cox? + 
c1x + Cp factors if and only if ce? — 4coc2 is a square, in which case it is trivial 
to write down the factorization. Further, it is easy to detect squares and take 
square roots of squares with a binary search. Thus the time for Step 3 in the 
case d = 2 is O(log n) arithmetic steps with integers at most n. (Newton’s 
method is even better than a binary search; its complexity is O(log log n) 
arithmetic steps with integers at most n.) 

When d = 3 we can again use a binary search in Step 3. In particular, 
f(a) factors if and only if it has a rational root, and if one rational root is 
found, we can reduce the problem to the quadratic case. It is more convenient 
to replace f(x) with c3f(x) = g(c3x), since g(x) factors if and only if it has 
an integer root. However, every integer root of g divides g(0), so if g(0) £0, 
then every integer root is in the interval [—|g(0)|, |g(0)|] and may be located 
with essentially a binary search. Thus again Step 3 can be accomplished 
in O(log n) arithmetic steps with integers at most n. (Note that Newton’s 
method could be applied here as well.) 

We now describe an algorithm based on the Brillhart, Lehmer, Selfridge 
n — 1 test. 


Algorithm 1. We are input an integer n > 4 and the complete prime 
factorization of n—1. This deterministic algorithm decides if n is prime 
or composite. 

Let F(1) =1. Fora =2,3,...,[(log n)3/?] do the following: 


Step 1 If ais composite, let F(a) = F(a—1) and go to Step 7. Ifa¥(@-) =1 
mod n, let F(a) = F(a—1) and go to Step 7. Verify that a"~! =1 mod n. 
If not, declare n composite and stop. 

Step 2 Using the prime factorization of n—1, find the least positive divisor 
E(a) of n—1 with a®™ =1 mod n. 

Step 3 Verify that (a®\/4 —1,n) =1 for each prime factor q of E(a). If 
not, declare n composite and stop. 

Step 4 Let F(a) = lem{F(a— 1), E(a)}. Compute F(a). 

Step 5 If F(a) >n'/?, declare n prime and stop. 
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Step 6 If n\/3 < F(a) < n‘/?, attempt to factor n by the base F(a) 
factorization method. If n is factored nontrivially, declare n composite and 
stop. If n is not factored, declare n prime and stop. 

Step 7 If a < [(log n)?/?], get the nett a. Otherwise declare n composite 
and stop. 


Proof of correctness. Since (log n)3/? <n for every integer n > 1, Step 1 is 
correct by Fermat’s little theorem. It is clear that Step 3 is correct from the 
definition of E(a). 

Suppose r is a prime factor of n and we have reached Step 4 of the 
algorithm for a particular a. Consider the subgroup G,.(a) of (Z/rZ)* defined 
in the Introduction. We shall show that #G,(a) = F(a). For each prime 
j with j < a we have j?( = 1 modn, so that 7° = 1 modr. Thus 
#G,(a)|F' (a). Further, if 7 is a prime with 7 < a and F(j) > F(j — 1), then 
from Step 3 the order of j in (Z/rZ)* is E(j). Since F(a) is the least common 
multiple of those numbers E(j) with 7 prime, j < a, and F(j) > F(j — 1), 
we have F'(a)|#G,(a). Thus #G,(a) = F(a), as asserted. 

We conclude that if we have reached Step 4 of the algorithm for a 
particular a, then for each prime factor r of n we have r = 1 mod F(a). 
Thus the correctness of Steps 5 and 6 follows from [8]. 

Suppose now that a = [(log n)?/?] and we have reached Step 7. Thus 
F(a) < n'/3. Suppose n is prime. For every a-smooth integer m in the range 
1<m<nwe have m mod n€G,(a). Thus 


F(a) = #Gn(a) = 4(n, a) = (1, (log n)*/?) > n°, 


where the last inequality follows from Theorem 1 and the fact that (log n)?/? > 
2 for n > 4. This is a contradiction and so Step 7 is correct. 
We conclude that Algorithm 1 is correct. 


Analysis of runtime. We measure the runtime by the number of arithmetic 
steps with integers no larger than n. By an arithmetic step we mean addition, 
subtraction, multiplication, division with remainder, greatest common divi- 
sor, and finding an inverse for a member of (Z/nZ)*. Using naive arithmetic, 
an arithmetic step can be accomplished in O((log n)”) bit operations. Using 
the FFT, an arithmetic step can be accomplished in O-((log n)'**) bit 
operations for each € > 0. 

One can use the sieve of Eratosthenes to prepare a list of all of the primes 
up to (log n)?/? in time O((log n)°/? log log n). For each prime number a, 
Step 1 can be accomplished in O(log n) arithmetic steps. Since the number 
of such primes is O((log n)°//log log n), an upper bound for the time spent 
in Step 1 is O((log n)°/?/ log log n). 

To do Step 2 we use a variation of the algorithm of [9]. First consider 
the case where n — 1 is squarefree; say n —1 = qy...qy with q,...,q% 
distinct primes. Then to find E(a) it suffices to find the set of q; which 
divide E(a). But q;|E(a) if and only if a("-)/% #1 mod n. The algorithm 
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of [9] computes all of the residues allizi% mod n = al—)/% mod n for 
i = 1,...,k. It breaks the computation into steps where at a particular 
step we are taking a residue x mod n and computing «” mod n for some j. 
Each q; is used O(log(& + 1)) times, so that the total number of arithmetic 
operations with integers at most n is 


k 
O log q; log(k +1) | = O(log n log(k + 1)). 


j=1 


Now consider the general case where we no longer assume that n — 1 is 


squarefree. Say n — 1 =?! ...q;*" with the q;’s distinct primes and the a;’s 


Bi 


4 


positive integers. We have q’’'||F(a) if and only if 6; is the least non-negative 


By- ay 7 
integer with ga; = 1 mod n. To compute the {;’s we combine the 
ideas from the squarefree case with a binary search. In the first step, we let 


my, = ger? - ger and let a, = a™! mod n. We use the algorithm of 


[9] with a, and the numbers el a to decide if 8; < [a;/2] or [a;/2] < 
Bi < ag for each i = 1,...,k. In the first case we replace gi? in m1 


“:/4) Tn the second case we replace q!°"/7! in my with q**—!*/41, Thus 


with q; Fi 

we have a number mz, we form aj = a2 modn and again we use the 
algorithm of [9], this time with the q;’s raised to exponents about a;/4. 
Continuing in this fashion we compute the (;’s and thus E(a). In the /-th 
step of this algorithm we are using the algorithm of [9] with numbers whose 
product is about n2'. Thus the number of arithmetic operations for the I- 
th step is O(2~' log nlog(k + 1)). Moreover, we can compute a; from aj—1 
in O(2~! log n) arithmetic operations. Thus summing over |, the number of 
steps for Step 2 for a particular value of a is O(log nlog(k+1)). The number 
of values of a for which we perform Step 2 is O(log n). (To see this, note that 
we only perform Step 2 when F(a) > F'(a— 1) and that F(a) is the product 
of the integers F'(j)/F(j — 1) for 2 < j < a.) Since k = O(log n), we have 
that the total time spent in Step 2 is O((log n)? log log n) arithmetic steps 
with integers at most n. 

For Step 3, note that to compute the greatest common divisor it is 
sufficient to work with a?@/4 modn rather than a”(/4, If E(a) = 
of ie ar where qi,.-.-, Qk’, are distinct primes and (1,..., G4, are positive 
integers, let a, = Ila” ‘mod n. We use the algorithm of [9] to compute 
ay livits modn = a ()/% modn for each i. The number of steps is 
O(log E(a) log(k’ + 1)) = O(log n log(k + 1)), where k > k’ is the number 
of distinct prime factors of n — 1. Thus as with Step 2, the total time spent 
in Step 3 is O((log n)? log log n) arithmetic steps with integers at most n. 

Steps 4, 5, and 7 are each O(1) arithmetic steps for each a and, as 
remarked above, Step 6 is O(log n) arithmetic steps for each a. Note that 
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we visit Steps 4, 5, and 6 for O(log n) values of a and we visit Step 7 
for O((log n)3/?) values of a. Thus the total time for all of these steps is 
O((log n)?) arithmetic steps with integers at most n. 

We conclude that in the worst case, Algorithm 1 runs in O((log n)°/?/ 
log log n) arithmetic steps with integers at most n. We have proved the 
following theorem. 


Theorem 2. Given an integer n > 4 and the complete prime factorization 
of n—1, Algorithm 1 correctly decides if n is prime or composite. Further, 
Algorithm 1 uses at most O((log n)°/?/log log n) arithmetic steps with 
integers at most n. 


We remark that in some cases when Algorithm 1 declares n composite, a 
nontrivial factorization of n may also be found. In particular, this is true in 
Steps 3 and 6. However most composite inputs will be proved composite in 
Step 1 with a = 2, in which case no nontrivial factorization of n is produced. 

The next algorithm may be considered an extension of the Brillhart, 
Lehmer, Selfridge n — 1 test. We shall use it as a subroutine in an improved 
version of Algorithm 1 we present below. 


Algorithm 2. This deterministic algorithm finds the complete prime 
factorization of n when input integers n, F such that F > n3/!° > 1 and 
each prime factor of n is 1 mod F. 


Step 1 Ifn < 243, factor n by trial division and stop. 

Step 2 If F > n'/3, use the method of [8] and stop. (That is, if F > n'/?, 
declare n prime; if n'/3 < F < n\/?, use the base F factorization method 
to factor n. Note that if the base F' factorization succeeds in factoring n, 
then it produces the prime factorization of n, while if it fails, then n is 
prime.) 

Step 3 We have n3/1° < F < n\/3. Attempt to factor n by the base 
F factorization method. If this succeeds in splitting n, report it as the 
complete prime factorization of n and stop. Let c,,c2,c3 be the base F’ 
“digits” of n, so that n = c3F°? +c2F?+qF +1. 

Step 4 Let c, = c3F +c2 so thatn = caF? +c, F +1. If either eax? +241 
or (cg — 1)a? + (c1 + F)x +1 are reducible in Z[x], this may lead to a 
factorization of n as in the base F' factorization method. If so, report this 
factorization as the prime factorization and stop. 

Step 5 Develop the continued fraction for c/F and let u/v,u'/v' be 
consecutive convergents with v < F?/./n < vu’. Let up = tu’, vo = £v’ be 
such that uvp + upv = 1. 

Step 6 For each integer d with \d — cav/F| < 2n3/?/F° (< 2) do the 
following. Find all integral roots s of the polynomial 


fa(z) = y? — ay? +ceay+ Fzy +z 
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where y = dup + vx, z = —dug + ux. For any integral root s found with 
(dug t+us)F +1 a nontrivial factor of n, report this number and its cofactor 
inn as the prime factorization of n and stop. If n is not split in this step, 
then declare n prime and stop. 


Proof of correctness. We first show that if n is factored in Step 3, then this 
step produces the complete prime factorization of n. First, it is clear that 
n has at most three prime factors. Thus if f(2) = c3v3 + coz? + c2 +1 
factors into three linear factors in Z[a], then these give, upon substituting 
F for x, the prime factorization of n. Suppose conversely that n has three 
prime factors. Thus there are positive integers a1,a2,a3 with n = (a, F + 
1)(a2F +1)(ag3F' +1). Since n > 243 we have F* > 3n, so that a,aza3 < F/3. 
Thus aja2 + aya3z + aga3 < 3ayaQa3 < F and a,j +a2+a3 < F.. We conclude 
that cz = a1a2a3, Cg = a1a2 + a1a3 + aga3, Cy = a, + ag + a3 and that 
f(x) = (a2 +1)(agx +1)(agx+1). That is, the base F factorization method 
will find the complete prime factorization of n. 

Suppose now that n has exactly two prime factors so that there are 
positive integers a,,a2 with n = (a1F + 1)(aa2F + 1). Assume ay < ag. If 
we obtain any nontrivial splitting of n in any step of the algorithm, evidently 
this gives the complete prime factorization of n. We now show that if n has 
not been factored in Steps 3 and 4 of the algorithm and if n is composite, 
then it will be factored in Step 6. 

Since n = 44 F? +c, F +1 = ajagF?+(a,+a2)F +1, there is some integer 
t > 0 with 


ajag = ca—t, ay tag = + tF. (7) 
From the failure of Step 4 to find a1,a2, we have t > 2. Thus 


4 ata, atek y 


F 8 
a oe a eee (8) 
and 
n n 
ee 9 
a1 Gg? ~ FS (9) 
We have from (7) that 
a, + a9 ajyag+ 1 C4 mr 
t< < : 1 
Sn ee a ead (10) 
Also (7) gives us the equation 
ajc, + aytF = a +¢4—t. (11) 


From the elementary theory of continued fractions we have 


ca wu 1 Jn 
< < : 12 
F aroun (12) 
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Using (11) we have 


ayu + aytu — sal = av (= — =) + (aic1 + atF) = — = 
us cy 7 uv Cav 
=av(—--—)+(ait+a-t 
av (> 7 Ss as ae 
Uy . v 
= Ee) age ty: 
av (> ) Oe 


Thus from (9), (10), (12) and the fact that v < F?/./n we have that 


ee nan 7 FP ae? 


C4U 
aju+a,tv — —|<a a = 


n 
PF OF? 


<i + 

F  F8 F2 FT /n Fs 
(13) 

Let d = aju+a,tv. Note that the general solution to yw+ zv = d is 

given by 
y = dup + us, z = dug — us, 
where s runs over the integers. Let s be the unique integer with 
a, =dvp + vs, a,t = dug — us. 

From (11) we have that s satisfies 


dug — us 


dup + us)? +¢4 — 
(dug ) : dug + us 


= (dup + vs)c1 + (duo — us)F. 
Thus from (13) we see that s is an integral root for one of the polynomials 


fa(z) presented in Step 6. This concludes the proof of correctness of 
Algorithm 2. 


Since the computation of the convergents u/v and u’/v' in Step 5 of the 
algorithm can be made part of the extended Euclidean algorithm for c; and 
F, it is clear that the runtime of Algorithm 2 is dominated by the calculations 
of the possible integer roots of the four cubic polynomials in Step 6. Thus 
Algorithm 2 runs in O(log n) arithmetic operations with integers at most n 
if a binary search is used to find the roots as discussed in connection with 
the base B factorization method above. 

We now use Algorithm 2 in the framework of Algorithm 1. 


Algorithm 3. We are input an integer n > 5 and the complete prime 
factorization of n—1. This deterministic algorithm decides if n is prime 
or composite. 

Let F(1) =1. Fora =2,3,...,[(log n)!°/"] do the following: 


Steps 1-4 These are exactly the same as in Algorithm 1 except that when 
F(a) = F(a—1) we go to Step 6. 
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Step 5 If F(a) > n3/!°, find the complete prime factorization of n with 
Algorithm 2 and stop. 

Step 6 If a < [(log n)!°/"], get the next a. Otherwise declare n composite 
and stop. 


We have already proved in connection with Algorithm 1 that if we do 
Steps 1-4 for 7 = 2,3,...,a, and we have not stopped, then every prime 
factor of n is 1 mod F(a). Thus Algorithm 2 is appropriate to use in Step 5. 
Suppose a = [(log n)!°/"] and we are in Step 6. If n is prime and G,,(a) is as 
before, then, as with the proof of correctness of Algorithm 1, we have 


F(a) = #Gn(a) > W(n, a) = W(n, (log n)!9/7) > n3/10 


by Theorem 1. Thus Step 6 is correct. We conclude that Algorithm 3 is 
correct. 
We have the following theorem. 


Theorem 3. Given an integer n > 5 and the complete prime factorization 
of n—1, Algorithm 8 correctly decides if n is prime or composite. Moreover, 
it uses at most O((log n)'"/7/log log n) arithmetic operations with integers 
at most n. 


4. When n — 1 Is Partially Factored 


In this section we describe a deterministic polynomial time algorithm that 
decides if n is prime or composite when n and a fully-factored divisor F' of 
n—1 are input with F > n1/4t¢, 


Algorithm 4. We are input an integer n and a number € with n > e°, 


0<e<3/4 and (log n)>/(4") <n. We are also input integers F, R with n— 
1=FR and F > n'/4+, and we are input the complete prime factorization 
of F. This deterministic algorithm decides if n is prime or composite. 

Let F(1) =1. Fora =2,3,...,[(log n)°/4*)] do the following: 


Step 1 Ifa is composite, let F(a) = F(a—1) and go to Step 7. If ak? (@—) 
1 mod n, let F(a) = F(a—1) and go to Step 7. Verify that a"~' = 
mod n. If not, declare n composite and stop. 

Step 2 Using the prime factorization of F, compute E(a), the order of 
a® mod n in (Z/nZ)*. Thus E(a) is the least positive divisor of F with 
gO) = nod 7. 

Step 3 For each prime factor q of E(a), verify that (a®?®/4 —1,n) = 1. 
If not, declare n composite and stop. 

Step 4 Let F(a) = lem{F(a— 1), E(a)}. Compute F(a). 

Step 5 If F(a) > naa. get the complete prime factorization of n by 
Algorithm 2. In particular, if n is prime, declare it so and stop; if n is 
composite declare it so and stop. 


Ill 
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Step 6 If F(a) > n\/4+/5, attempt to factor n by the base F(a) factor- 
ization method. If this succeeds in splitting n, then declare n composite 
and stop. Let c1,C9,c3 be the base F(a) “digits” of n so that n = 
c3F' (a)? +coF (a)? +c F (a)+1. Let c, = cg F(a)+cp. If either cax? +c,2 +1 
or (cq — 1)? + (ce, + F(a))x +1 are reducible in Z[x], this may lead to 
nontrivial factorization of n by substituting F(a) for x. If so, declare n 
composite and stop. 

Step 7 Ifa < [(log n)°/(4")] get the nezt a. If a = [(log n)°/4")] and F(a) 
n'/4+</5 declare n composite and stop. If a = [(log n)°/“©)] and F(a) 
ni/4+/5 | declare n prime and stop. 


= 
> 


Proof of correctness. Since (log n)>/(4*) < n, Step 1 is correct. Step 3 is 
clearly correct. We recall the definition of G,.(a) from the Introduction. If we 
have passed Step 3 of the algorithm for 2,3,...,a, then F(a)|#G,(a) and 
#G,(a)|RF (a) for each prime factor r of n. In particular r = 1 mod F(a). 
Thus it is appropriate to use Algorithm 2 in Step 5. 

It is clear that Step 6 is correct since it only declares n composite when 
it succeeds in splitting n. Suppose we are in Step 7 and a = [(log n)°/(4®)]. 
If n is prime, then as in the analysis of Algorithm 1, we have 


RF(a) > #Gn(a) > W(n, a) = V(n, (log n)>/42)) > nb-4e/5 


by Theorem 1. Since R < n°/4-*, we thus have F(a) > n'/4+#/5, We conclude 
that if F(a) < n'/4+*/5, then Step 7 is correct in declaring n composite. 
Suppose finally we are in Step 7, a = [(log n)°/4")], F(a) > n'/4t2/5 and 
n is composite. Since Step 6 was not able to split n, we have as with the 
analysis of Algorithm 2 that n is the product of two primes. (It is here where 
we use the hypothesis n > e? since this assures that F(a)* > n!+4*/5 > 
n((log n)>/(4#))4¢/5 = n log n > 3n.) Say n = ryr2 where r, < rz are primes 
and r; = bi) F(a) +1, rg = boF(a)+1, where by, bg are positive integers. Again 
from the failure of Step 6 to factor n and the argument for Algorithm 2 
(see (11) and (9)) we have 


2 n z nr 
+” bo F(a)? ~ F(a)?’ 


b> AF > Ra), 


Thus 
#G,,(a) < (RF(a),r2 —1) < (n—1,r2—1) 
= (bj boF (a)? + (b1 + b2) F(a), b2F(a)) 
= (b\bF (a) + by + bo, b2) F(a) = (b1, b2) F(a) 


rg <n */ bry, 
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On the other hand, as before, we have 


#Gr.(a) > U(r2,a) = U(r2, (log n)°/4)) > P(r2, (log r2)°/49)) > rp ¥° 


by Theorem 1. These last two displays are incompatible. Thus Step 7 is 
correct in declaring n prime when a = [(log n)>/(4*)] and F(a) > ni/4te/5, 
This concludes the proof of correctness for Algorithm 4. 


The runtime analysis for Algorithm 4 is argued analogously to that of 
Algorithm 1. We have the following theorem. 


Theorem 4. On input of an integern > e?, a number e in the range 0 <e < 
3/4 with (log n)°/(4) <n, integers F, R withn —1=FR and F > n\/44¢, 
and the complete prime factorization of F', Algorithm 4 correctly decides 
if n is prime or composite. The runtime is O((log n)!+5/(4*)/log log n) 
arithmetic operations with integers at most n. 


5. Primes Recognizable in Deterministic Polynomial 
Time 


The algorithms in the preceding two sections to determine whether n is prime 
or not all hypothesized substantial information about the factorization of n—1 
(or knowledge about the prime factors of n). In this section we present an 
algorithm that can be applied to any number n. For most numbers, this 
algorithm will not be very efficient—in fact it will take exponential time. 
However there are also many primes for which the algorithm will work in 
polynomial time—more than «!~* of them up to zx. Before we give this result, 
we first state the algorithm. 


Algorithm 5. Suppose we are input a positive integer n > 5 x 10'4. This 
deterministic algorithm decides if n is prime or composite. 


Step 1 Continue using trial division on n—1 until a fully factored divisor 
F of n—1 is found with F > n'/3, 
Step 2 Use Algorithm 4 with inputs n, ¢ = 1/12, F, R= (n—1)/F. 


It is clear that Algorithm 5 is correct. It is also clear that for some numbers 
it is a terrible algorithm. For example, if n is even, one might well spend 
exponential time discovering that n is composite. Nevertheless, Algorithm 5 
is able to prove prime quite a few numbers in polynomial time. 


Theorem 5. For eache > 0 there are numbers k and xp such that if x > xo, 
then the number of primes p <x which Algorithm 5 proves prime in at most 


E 


(log p)* arithmetic steps with integers at most p exceeds x1~*. 
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The proof of this theorem depends strongly on the distribution of primes 
p for which p—1 has a large smooth divisor. We establish such a result now. 


Theorem 6. There are effectively computable positive constants c,,x1 with 
the following property. Suppose x > x1, log x < y < a/?° and N(a,y) is 
the number of primes p < x such that p—1 has a y-smooth divisor exceeding 
a/3_ Then N(a,y) > a/(c, log z)!*+“/3, where u = log a/log y. 


Proof. Let x2 be the number x-,5 in Theorem 1 of [4], where « = 1/11 and 

6 = 1/60. If x > xg, let d,,d2,...,dy be the possible “exceptional moduli” 

corresponding to x in this theorem, so that they all exceed log x and k = 

k(x) = O(1). Let q; denote the greatest prime factor of d; fori =1,...,k. 
Let 


P={q prime :y/2<q<y}\{u,---,a6}- 


Thus if an integer d is composed solely of primes from P, then no d; divides d. 
From Mertens’s Theorem we have that if x is sufficiently large, then 


re 25° ae 


qeEP qeEP 


14 
oe y m2) 


Let v = [(log(x!/3))/log(y/2)] and let D(P, v) denote the set of integers 
d composed of v not necessarily distinct primes from P. If d € D(P, v), then 
clearly d > x!/3 and d is y-smooth. Further, if x is sufficiently large, then 


d < ytt(log(x'/*))/log(y/2) — y(q1/3ylog v/los(y/2) < 42/5, 


Thus if a is sufficiently large and d € D(P,v), we have from Theorem 1 
n [4] that 


9 x 
m(x,d,1)= 7 12 -——, (15) 
p<a,d|p—1 10y(d)_ log # 


where p runs over primes and y denotes Euler’s function. 

For n a positive integer, let (n, P) denote the largest divisor of n composed 
of only primes from P. For p a prime, let d(p, v) denote the number of divisors 
of p— 1 which come from D(P,v). Note that d(p,v) = 1 if and only if there 
is some d € D(P, v) with dip — 1 and ((p— 1)/d,P) = 1. Thus 
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N(a2,y) > S- 1> S- 1= S- a 1 


pSa psx dED(P,v) p<a,d|p—1 
dpeet. aipa)=1 ((p=1)/4,P)=1 
= oy se De, oe 
deD(P,V) psx d€D(P,v) p<a,d|p—1 
d|p—1 ((p—1)/d,P)>1 
(16) 
a Dy Oy I De. De 
deD(P,V) psa d€D(P,v)qEP p<x 
d|p—1 dq\p—1 
= dD) mediy— 7 Dd) a(eda1) 
deD(P,v) deD(P,v) qEP 


From (15) we have 
> Ce Se > —_ (17) 


if x is sufficiently large. From the Brun-Titchmarsh inequality we have 


d Lemead< D0 Lo cape 


deD(P,v) qeP d€D(P,v) geP * 


x 1 
Ot ear Taare: an i 
log x log y oe r y(d) 


where we use (14) for the last inequality. Putting this estimate and (17) into 
(16) we have for all sufficiently large x that 


(18) 


We now estimate this last sum. We have from (14) that 
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Vv 


= exp(—v log uv — v log log y + O(v)) 


1 
= exp(— =u log u — gt log log y + O(u)) 


1 
3 


1 
= exp(— 3U log log « + O(u)). 


Putting this in (18) we have the theorem. 


Proof of Theorem 5. First note that from Theorem 6, if k > 1 is arbitrary, 
then the number of primes p < a for which p—1 has a (log x)*-smooth 
divisor F with F > «1/9 is at least 2!—!/@*)+°( as & — oo. For such primes 
p, Step 1 of Algorithm 5 takes at most O((log x)*) arithmetic steps with 
integers at most p. The number of arithmetic steps with integers at most p 
to complete Step 2 of Algorithm 5 is O((log p)'®/ log log p). 

It suffices to establish the theorem for values of € satisfying 0 < € < 1/48. 
In this case let K > K’ > 1/(3e) be arbitrary. If p > 2'/3 and c is any 
constant, then (log p)* > c(log x)*’ for all large x. Thus if x is large, p is a 
prime with p < x and p—1 has a (log x)* -smooth divisor F > x!/3, then 
Algorithm 5 takes at most (log p)* steps with integers at most p to prove p 
prime. By the above, the number of such primes is at least 2!~!/(@4")+0() 
for x > oo. As 1— 1/(3K"’) > 1 —«, the theorem follows. 


6. More Primes Recognizable in Deterministic 
Polynomial Time 


In this section we describe a deterministic algorithm that recognizes many 
more primes in polynomial time than our previous methods. Covered is any 
prime n with a divisor F of n—1 exceeding n* and such that all of the prime 
factors of F are at most (log n)*. The running time is about O((log n)?/* + 
(log n)*). We also show that for most such primes, the 2/¢ can be reduced 
to 1/e. A corollary of this algorithm is an improved version of Theorem 5. 
There if we were willing to spend time (log n)* on trying to prove n prime, 
we would succeed for about z!~(3*)"" primes up to z. With the methods of 
this section we will succeed for about x1~* * primes up to x. 

If n is prime and the order of b in (Z/nZ)* is E, then for any a with a? = 1 
mod n, there is some exponent j € {1,2,...,E} witha=0’ mod n. Thus if 
it is shown that no such j exists, then it is proved that n is composite. How 
difficult is it to do this test? If we have already prepared the complete set {b/ 
mod n: 7 = 1,2,...,E}, then testing if there is some j with a=b’ modn 
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can be accomplished by a binary search in O(log E) steps. Thus we have the 
initial step of preparing the set of powers of b, which takes F steps, and then 
each subsequent test takes O(log FE) steps. 

In the case when E = q® with q prime and 6 > 1, we can do a 
precomputation taking q steps, with each subsequent test taking O(8? log q) 
steps. Here is how. Suppose the order of b mod n in (Z/nZ)* is q’. Assuming 
that we have already computed ov" mod n (if not, this takes an additional 
O(6 log q) steps for the precomputation), we can compute the set 


B={e” mod n:j =1,...,q} 


in g steps. Now suppose we are presented with some integer a with the order 
of a mod n in (Z/nZ)* equal to g*, with 0 < a < 6, and we wish to see if 
there is some integer j with a= b/ mod n. If a = 0 or 1, then if j exists it 
is a multiple of g?—!, and so we test for membership of a mod n in B by a 
binary search. As an induction hypothesis suppose 1 < a < 6 and we have 
already described how to find 7 for any a’ for which the order of a’ mod n 
in (Z/nZ)* properly divides g*. Note that the order of a4 mod n is q*~!, so 
we may use our inductively described algorithm to search for some integer 
jo with a? = 63° mod n. Suppose we have found jy. Then it must be that 
g2--) divides jo and, in particular, q|jo. Then ab-°? ' mod n has order 
dividing q. But we have already described how in this case we may search for 
an integer j, with ab-J07 ' = b/. mod n. If these searches are successful, we 
may take 7 = j, + joq7!. Totaling up the time spent, we have used a binary 
searches in the set 6, we have done a—1 modular multiplications, and we have 
done a — 1 modular powerings with exponents at most q? (in fact, at most 
q°-1). The latter computation dominates, taking O(a log q) = O(6? log q) 
arithmetic steps with integers the size of n. 

The computation of B, which is the precomputation step of this method, 
we call “Set up (b,¢°)”. The subsequent search for an exponent j we call 
“Test. (b,q°; a)”. With these subroutines we are now ready to describe the 
main algorithm of this section. 


Algorithm 6. We are given positive integers n, F, R and a positive number 
e such that n > 4, 2 < (log n)?/* <n, n—1 = FR, and F > né. This 
algorithm decides if n is prime or composite. 

Let F(1) =1. Fora =2,3,...,[(log n)?/*] do the following. 


Step 1. Check if n is even or if n is a nontrivial power. If so, declare n 
composite and stop. 

Step 2. Verify thata"-'=1 mod n holds. If not, declare n composite and 
stop. 

Step 3. Compute E(a), the order of a® mod n in (Z/nZ)*. Let F(a) = 
Iem{E(a), F(a — 1)}. For each prime q|F(a)/F(a — 1), verify that 
(aPRE(a)/q _ 1,n) =1 holds, declaring n composite and stopping if not. 
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Step 4. For each prime q and positive integers a, 8 with q* || (E(a), 
F(a—1)) and q? || F(a—1) do Test (bg, q®;aP#()q“). Tf this test proves 
n composite, then declare this and stop. 

Step 5. For each prime q and positive integer B with q|F(a)/F(a—1) and 
q? || F(a), let by = a2" mod n and do Set up (bq, q°). 

Step 6. If F(a)&®* > nlesler, declare n prime and stop. If a < 
[(log n)?/*], get the neat a. If a = [(log n)?/*], declare n composite 
and stop. 


Proof of correctness. Suppose we have made it to Step 6 and a = [(log n)?/*]. 
Suppose n is prime. From Theorem 1 we have #G,(a) > n!~*/?. Every 
b € Gn(a) satisfies bo" YF/F = 1 mod n, so #Gn(a) < (n-1)F(a)/F < 
n'—* F(a). Thus F(a) > n*/?, so that F(a) ¢ > nies 8", Hence it is correct 
to declare n composite when this inequality fails. 

Suppose n is composite and suppose we have made it to Step 6 for a 
particular a. From Step 1 we know that n is divisible by at least two distinct 
odd primes. From Step 3 we know that each prime factor of nis 1 mod F'(a). 
Let F ={b mod n:b"-!=1 mod n}. By the Chinese remainder theorem, 
this subgroup of (Z/nZ)* is isomorphic to the direct product of the groups 
F, ={b mod p®:b”~!=1 mod p*} where p® runs over the prime powers 
with p® || n. Since p is an odd prime, (Z/p%Z)* is cyclic so that #F, = 
(n — 1, e(p*)) = (n—1,p—1). Thus for each prime power q? with q° || F(a) 
we have q? |#Fp. Since n has at least two distinct prime factors p, the number 
ofb mod n € F with b? =1 mod nis at least q2°. From Step 2, Gn(a) is a 
subgroup of F. And from Step 4 we have that G,,(a) has exactly g? members 
b mod n with 6” =1 mod n. Thus the index of G,,(a) in F is at least the 
product of the prime powers q? with q® || F(a), which is F(a). We have from 
Theorem 1 that 


n 


(a)’ 


1 
1—log log n/ log a 
ni—log log < #Gn(4) 5 FG [[@ 1,p <= 


p\n 


so that F(a) < nos les "/le8 ¢, Hence it is correct to declare n prime when 
F(a)'°8 @ > nies 08 "| This concludes the proof of correctness. 


Analysis of runtime. For each integer k < log n/log 2 we can check if n is a 
k-th power by computing [n!/"] with a binary search and seeing if [n!/*]* = n. 
When k > 3, the binary search may begin with [n!/(*-))]. Thus the number 
of arithmetic operations to see if n is a k-th power is o( e+ log n), so the 
total number of arithmetic operations for Step 1 is O(log n(log log n)?). 

The time for Step 2 is at most O((log n)!+?/*), 

For each a, the time for Step 3 is O(log nlog log n). Thus the total time 
for Step 3 is O((log n)'*?/* log log n). 
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As we have seen, each implementation of Test (b,,q°;a) in Step 4 takes 
time O(8? log q). Thus the total time for Step 4 is O((log n)?+?/*). 

Each time we do Set up (bg, ¢°) in Step 5, it takes time O(q + 6 log q). 
Thus if Q is the total number of prime factors of F', counted with multiplicity, 
and if each prime factor q of F satisfies g < B: then the total time for Step 5 
is O(BQ + 0? log n) = O(B log n + (log n)). 

Thus the total number of arithmetic steps with integers at most n is 
O((log n)?+?/© + B log n). 

We have the following theorem. 


Theorem 7. On input of positive integers n, F, R and a positive number ¢ 
withn > 4,2 < (log n)?/* <n,n—-1=FR and F > n*, Algorithm 6 correctly 
decides if n is prime or composite. The running time is O((log n)?+?/* + 
B log n) arithmetic steps with integers at most n, where B is the largest 
prime factor of F. 


We remark that the term (log n)?+?/* in the running time may be 
replaced with e(log n)?+?/*/log log n if we perform Steps 2 to 4 only for 
prime values of a. 

The time bound in Theorem 7 is only an upper bound. With the aid of the 
next result we will be able to show that most primes for which Algorithm 6 
is applicable are proved prime in a considerably shorter time. 


Theorem 8. For 2 < y < = let R(x,y) denote the number of primes p 
in the range y < p < x such that (Z/pZ)* is not generated by the set {a 
mod p:1<a< y}; that is, such that G,([y]) 4 (Z/pZ)*. Then R(x,y) < 
8x? log log(x?) / logy 


Proof. Let Z denote the set of y-smooth numbers up to x?. Suppose p is a 
prime counted by R(x, y). Then #G,([y]) divides (p — 1)/q for some prime gq. 
Thus the set Z occupies at most (p—1)/q residue classes mod p, so there are 
at least p/2 residue classes mod p free of elements of Z. Thus by the large 
sieve (see Theorem 3, p. 159 in [12]) we have 


Thus from Theorem 1 we have 


8a? 8x2 
R ae 
9) S oz = Fey) 


which completes the proof of the theorem. 


< 8z2)8 log(2”)/log y_ (19) 


Because of the use of Theorem 1 in the proof, Theorem 8 is only fairly 
good when y is a power of log «. If we let v = 2 log a/ log y and use a stronger 
estimate for w(x”, y) (see [10]) we may obtain the following result which is 
valid in the same range as is Theorem 8: 
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R(a,y) < exp(v(log v + log log v — 1 + (log log v — 1)/ log v) 
+ O(u(log log v/ log v)?)). 


This estimate is implicit in the dissertation of Pappalardi [21, Sect. 3.3]. 
The estimate has an inexplicit constant, though an actual numerical value 
could be provided in principle. We remark that Vinogradov, Linnik and 
Fridlender have discussed problems related to the estimation of R(x, y). 


Theorem 9. Let x,¢ > 0 be arbitrary and let E(x,¢) denote the number of 
primes p with (log p)2/é < p< «x for which there is some integer F > p* 
with F|p — 1 and such that if Algorithm 6 is run on n = p, F, €, then 
F(a)°8 @ < pls 8 P for a = [(log p)'/*]. Then E(x,€) < 9x + €3?. 


Proof. Suppose Algorithm 6 is run on n = p, F', ¢ where F|p—1 and F > p*. 
If F(a) = F with a = [(log p)‘/*], then F(a)'°% ¢ > p'os 8 ?, Thus we may 
assume that if p is counted by E(a,¢), then F(a) < F with a = [(log p)!/*]. 
Thus (Z/pZ)* is not generated by {7 mod p: 1 < 7 < a}. We conclude from 
Theorem 8 that 


E(e,2) — B(e?, 2) < R(a, [og 2)"/*1) 
< R(a, (log x!/?)¥/*) 


2 1/2 
< Rr2e log log(a*)/ log log(x ). 


Note that if > e?, then log log(x?)/log log(a!/?) < 3/2. Thus if k is 
that positive integer with 22" < 32 <2?“~” then 


E(x,e) =) (E(a? ",e) - E(u?’ ,e)) + E(x? “,e) 
1=0 


< ¥°8(x? *)% + E(e?2,€) 


< Ox ae en, 


which concludes the proof of the theorem. 


We remark that Theorem 8 can also be used in the context of Algorithm 4 
to give a small improvement in that algorithm for most primes. 
We now give an algorithm similar to Algorithm 5. 


Algorithm 7. Suppose we are input a positive number € and an integer 
n > 4 with 2 < (log n)?/* <n. This algorithm attempts to decide if n is 
prime or composite. 
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Step 1. Using trial division, find the largest divisor F of n — 1 composed 
of primes up to (log arr, If F < n*, then stop for the algorithm has 
failed. 

Step 2. Use Algorithm 6 on n, F, €, terminating with failure if the 
parameter a exceeds [(log n)'/*]. 


From the proof of Theorem 6, for each number ¢ with 0 < « < 1, thereisa 
number x2(¢), such that if x > x2(e), the number N of primes p < x for which 
p—1 has a (log p)!+!/£-smooth divisor exceeding p® satisfies N > 2x!-©*. For 
such primes p we make it past Step 1 of Algorithm 7. From Theorem 9 we 
have that if 0 <e¢< 1/4, then at least half of these primes are proved prime 
in Step 2 of Algorithm 7, though x2(¢) may have to be adjusted. We thus 
have the following result. 


Theorem 10. Let ¢ be any number with 0 < ¢ < 1/4. There is a number 
xo(e) such that if x > ae(e) then the number of primes p < x which 


Algorithm 7 proves prime exceeds gi 


We remark that the running time of Algorithm 7 is O((log n)?+!/*) 
arithmetic steps with integers at most n. Thus Theorem 10 improves on 
Theorem 5 since there if one wants to prove prime gine primes up to 2, the 
bound for the running time is about (log n)!/@«"), 


7. Update on Primality Testing: Background 


The previous sections comprised the original article in the first edition of this 
volume published in 1997. This section and the next one summarize the state 
of primality testing in 2013, 16 years later. 

The subject of primality testing concerns the creation and analysis of 
efficient algorithms for deciding whether a given integer n > 1 is prime or 
composite. This subject is closely related to, but distinct from, factoring. 
Some algorithms, such as trial division, can accomplish both tasks, but the 
most efficient methods are tailored to one or the other. 

From a practical point of view, the story of primality testing is a 
simple one. In real-world applications one does not require mathematical 
certitude, a tiny possibility of error being acceptable, so various random 
algorithms that have been known for decades and are easy to implement may 
be used. An example, commonly known as the Miller-Rabin test, runs in 
O((log n)?**) bit operations (using fast arithmetic subroutines) and almost 
certainly returns a correct verdict on the primality of a given input n, 
with the bonus that a composite verdict is mathematically correct. Even 
the simple base-2 Fermat congruence 2"~! = 1 (mod n) when applied to 
a large random input n almost certainly steers one right (a number n for 
which the congruence holds is almost certainly prime, a number n for which 
it does not hold is definitely composite). Indeed, as shown by Erdés [13], 
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composite numbers n satisfying 2"~' = 1 (mod n) are much scarcer than 
primes. For more details on these and similar tests see [11] and the references 
there. 

But a problem as fundamental as deciding primality cries out for a 
thorough mathematical analysis. Here too, where no possibility of error is 
to be tolerated, there is both a theoretical and practical side. The practical 
primality tester has specific numbers n in mind that are to be tested, and 
wishes to implement an algorithm that will give a completely correct answer. 
It is possible for such an algorithm to use randomness, where coins are flipped 
(figuratively), but there is no doubt in the output, the only issue being the 
running time of the algorithm. A simple but illustrative example is that of 
finding a quadratic nonresidue for a given prime p. This is an integer k such 
that the congruence x? = k (mod p) has no integral solutions. We know 
that for an odd prime p exactly (p — 1)/2 choices for k& in {1,...,p — 1} 
are quadratic nonresidues. Moreover, via either Euler’s criterion or the law 
of quadratic reciprocity for Jacobi symbols, it is possible to decide quickly 
(and deterministically) if a candidate k works or not. So a random and 
quick method to find a quadratic nonresidue k is to choose randomly from 
{1,...,p — 1} until one is found. This simple algorithm runs in expected 
polynomial time. Remarkably, without assuming an unproved hypothesis 
(the Extended Riemann Hypothesis), we know no deterministic method for 
finding a quadratic nonresidue that runs in polynomial time. 

Long before our article was published, we had the Adleman—Huang test 
[1], a random primality test with running time expected to be polynomial 
(and, as opposed to the Miller—Rabin test, there is no doubt in the output). 
Based on the arithmetic of Jacobian varieties of hyperelliptic curves of genus 
2 (and also on elliptic curves), it is a very difficult result, requiring an 
entire volume for its analysis. Other tests, based on elliptic curves (practical 
improvements of the Goldwasser—Kilian test) are not theoretically complete, 
but stand as excellent practical primality proving algorithms for those who 
do not wish to have any possibility of error. Again, see [11] for more on this. 

And this brings us to the last holdout of the theorist: a deterministic 
primality test that runs in polynomial time. Such a test has long been 
known (in fact, a version of the Miller-Rabin test), but it relies on the 
Extended Riemann Hypothesis in a similar way as the quadratic nonresidue 
problem mentioned above. Withot any unproved hypothesis, we had the APR 
test [2] with complexity O((log n)*° 18 18 !°8 "), tantalizingly close to being 
polynomial. We also had an interesting result of Pintz, Steiger, and Szemerédi 
that presented a set of primes, with counting function to x of about 22/3, 
which could be recognized in deterministic polynomial time. These primes p 
were characterized by p — 1 being divisible by a very large power of 3. 

In our paper we showed that any prime p can be proved prime by a 
deterministic algorithm in polynomial time, provided we have a fully-factored 
divisor F > p* of p—1. Furthermore, a simple procedure identifies more than 
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x'-© primes p up to x with such a fully-factored divisor in p— 1, and so we 


have many primes that are recognizable in polynomial time. The case when 
F > p'/?+© was done earlier by Fiirer [17] (we only learned of this paper 
recently), and the case when p — 1 itself is fully factored was rediscovered by 
Fellows and Koblitz [16]. 

Our paper had one practical component for those interested in imple- 
menting a primality test. The so-called “n — 1 test” of Brillhart, Lehmer, 
and Selfridge requires a fully-factored divisor of p — 1 larger than p'/?. Our 
paper was able to reduce the exponent 1/3 in this practical test to 3/10. 
(For positive exponents smaller than 3/10 our algorithm still has polynomial 
complexity, but it is not so practical.) More recently, an analog of this 
improvement was accomplished for the “n +1 test”, see [11], though it is 
no longer deterministic. 


8. Update on Primality Testing: Derandomization 
and the AKS Algorithm 


By far the most important development since our article was the AKS 
algoritm [3], named for its inventors, Agrawal, Kayal, and Saxena. Their 
algorithm is deterministic and it distinguishes between primes and composites 
in polynomial time. Further it does not depend on any unproved hypotheses 
for its analysis. 

Like the algorithms in our paper and in many other approaches to 
primality testing, the AKS algorithm either recognizes n as composite by 
a series of simple tests, or if nm passes all of these tests, a group is built up 
that is so large that n is inescapably prime. (For more on this line of thought 
see [24].) 

Two analyses of the AKS algorithm are presented in [3], a more 
elementary analysis using effective tools and running time O((log n)!°->**), 
and an analysis using ineffective tools and running time O((log n)7°+*). Both 
of these estimates are upper bounds for the true running time, conjectured 
to be O((log n)®**). A version of the AKS algorithm with this running time 
and effective tools is presented in the preprint [20] and is described in [11]. 

Unfortunately, the AKS algorithm has not proved to be numerically 
competitive with previous primality tests. Even a random version with 
expected running time O((log n)**) (see [5]) is not competitive. 

In [3] a conjecture is made that suggests a version of the AKS algorithm 
has running time O((log n)?**). Using a heuristic of Erdés [14] on Carmichael 
numbers, Lenstra and Pomerance (unpublished) have given a plausibility 
argument that this AKS conecture is false. 

Since we knew already a random polynomial-time algorithm for primality 
testing, the AKS test might be thought of as a derandomization, even though 
it bears little resemblance to the Adleman—Huang test. Similarly, the Fiirer 
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and Fellows—Koblitz algorithms for proving the primality of a prime p where 
a large part of p — 1 is factored are derandomizations of an algorithm of 
Lucas, as improved by Proth, Pocklington, and Lehmer early in the twentieth 
century. Our paper as well contains a derandomization (and extension) of the 
Brillhart, Lehmer, Selfridge n — 1-test, as mentioned. In [29], Zralek applied 
some of the methods of our paper to derandomize a factorization algorithm, 
namely the p—1 method of Pollard. Here, one is expected to find quickly those 
prime factors p of n which have the additional property that all of the prime 
factors of p— 1 are small. (For this reason, some implementers of the RSA 
cryptosystem use prime factors p,q where both p— 1,q— 1 have large prime 
factors, so-called safe primes.) The Pollard p — 1 method uses randomness 
and Zralek derandomizes it. In a later paper he again uses similar ideas, this 
time to factor polynomials over some finite fields F,. 
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1. Introduction 


Let A be a subset of an abelian group. Let hA denote the set of all sums of 
h elements of A with repetitions allowed, and let h’ A denote the set of all 
sums of h distinct elements of A, that is, all sums of the form a, +---+ an, 
where aj,...,a, € A and a; # a; fori Fj. 

Let A be a set of & congruence classes modulo a prime p. The Cauchy- 
Davenport theorem states that 


|2A| > min(p, 2k — 1), 
and, by induction, 
|hA| > min(p, hk —h+1) 
for every h > 2. Erdés and Heilbronn conjectured 30 years ago that 
|2* A| > min(p, 2k — 3). 


They did not include this conjecture in their paper on addition of residue 
classes [10], but Erdés has frequently mentioned this problem in lectures 
and papers (for example, Erdés-Graham [9, p.95]). Dias da Silva and 
Hamidoune recently prove this conjecture. They used linear algebra and the 
representation theory of the symmetric group to show that 


|h* A| > min(p, hk — h? +1) 


for every h > 2. 

The purpose of this paper is to give a complete and elementary exposition 
of this proof. Instead of representation theory, we will use the combinatorics 
of the h-dimensional ballot numbers. 


2. Multi-dimensional Ballot Numbers 


The standard basis for R” is the set of vectors {e1,...,e,}, where 
e; = (1,0,0,0,...,0) 
eo = (0,1,0,0,...,0) 
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e, = (0,0,0,...,0,1). 


The lattice Z” is the subgroup of R” generated by the set {e1,...,e,}, so 
Z" is the set of vectors in R” with integral coordinates. Let 


a= (ao, @1,---+;@n—1) rae At 
and 
b = (bo, b1,..-,bn-1) € Z". 


A path in Z" is a finite sequence of lattice points 


a=V0,V1,---;Vm =b 
such that 
vj — vj-1 € {e1,...,en} 
for 7 = 1,...,m. Let vj;_1, vj; be successive points on a path. We call this a 


step in the direction e; if 
Vj =Vj-1 + &. 
The vector a is called nonnegative if a; > 0 fori =0,1,...,h —1. We write 
a<b 


if b — a is a nonnegative vector. 
Let P(a,b) denote the number of paths from a to b. The path function 
P(a,b) is translation invariant in the sense that 


P(a+c,b+c) = P(a,b) 
for all a,b,c € Z". In particular, 
P(a,b) = P(0,b-— a). 


The path function satisfies the boundary conditions 


P(a,a) =1, 
and 
P(a,b) > 0 if and only if a <b, 
If a= vo, V1,---;Vm =b is a path, then 
Vm_-1 =b-@e; 
for some i = 1,...,h, and there is a unique path from b — e;, to b. It follows 


that the path counting function P(a, b) also satisfies the difference equation 
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Let a <b. For: =0,1,...,k—1, every path from a to b contains exactly 
b; — a; steps in the direction e;4,. Let 


h-1 
i=0 


Every path from a to b has exactly m steps, and the number of different 
paths is the multinomial coefficient 
(Sizo (bi — a1)! ml 
P(a, b) _ =o seaneaT TETEE | : (1) 
Ls (d; = ai)! eae (d; =a ai)! 
Let h > 2. There are h candidates in an election. The candidates will be 
labelled by the integers 0,1,..., — 1. Suppose that mo votes have already 
been cast, and that candidate 7 has received a; votes. Then 


Mo = ao + G1 +++ + Gpn-1. 
We shall call 
Vo =a = (a0, @1,---,Qn-1) 


the initial ballot vector. There are m remaining voters, each of whom has one 
vote, and these votes will be cast sequentially. Let v;, denote the number of 
votes that candidate 7 has received after k additional votes have been cast. 
We represent the distribution of votes at step k by the ballot vector 


Vi = (U0,k Ul,ky +) UR—1,k)- 
Then 
Vok + U1 +++ + Unik =k + mo 
for k = 0,1,...,m. Let 
Ve = S Ba Bis chet) 


be the final ballot vector. It follows immediately from the definition of the 
ballot vectors that 


Ve — Ve-1 € {€1,..., en} 
fork =1,...,m, and so 
a=Vo0,V1,---;Vm =b 


is a path in Z’ from a to b. Therefore, the number of distinct sequences of m 
votes that can lead from the initial ballot vector a to the final ballot vector 
b is the multinomial coefficient 
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Let v = (v1,...,Un) and w = (wi,..., wn) be vectors in R”. The vector 
v will be called increasing if 


vy S v2 S++ Sp, 
and strictly increasing if 
Vy < V2 <<... < Uh. 
Now suppose that the initial ballot vector is 
a = (0,0,0,...,0) 


and that the final ballot vector b = (bo,b1,...,0,-1) is nonnegative and 
increasing. Let 


m = bo + by +--+ + bpn_1.- 


Let B(bo, b1,...,b6p-1) denote the number of ways that m votes can be cast 
so that all of the k-th ballot vectors are nonnegative and increasing. This is 
the classical h-dimensional ballot number. Observe that 


B(0,0,...,0)=1 


and 
B(bo, bi, say bn—1) >0 

if and only if (bo,b1,...,6n-1) is a nonnegative, increasing vector. These 
boundary conditions and the difference equation 

h-1 

B(bo, b1,...,0n-1) = > B(bo,..-,be—1, Bs — 1, Bist... Dn—1) 

i=0 

completely determine the function B(bo, b1,...,bp—1). 


There is an equivalent combinatorial problem. Suppose that the initial 
ballot vector is 


a* = (0,1,2,...,h—1), 
and that the final ballot vector 
b = (bo, b1,..., br—1) 


is nonnegative and strictly increasing. Let 


Let B(bo, b1,...,6,-1) denote the number of ways that m votes can be cast 
so that all of the ballot vectors v;, are nonnegative and strictly increasing. 
We shall call this the strict h-dimensional ballot number. 
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A path vo,vi,...,Vm in Z” will be called strictly increasing if every 
lattice point v;, on the path is strictly increasing. Then B(bo, bi1,...,bn—1) is 
the number of strictly increasing paths from a* to b = (bo,..., br—1). 


The strict h-dimensional ballot numbers satisfy the boundary conditions 
B(O,1,...,h-1)=1 
and 
B(bp, b1,-..,bn—1) > 0 


if and only if (bo,b1,...,b,-1) is a nonnegative, strictly increasing vector. 
These boundary conditions and the difference equation 


h-1 
Bhp digeres byt) = Yo Bboy. 55 Bete Dy pe oc DRE) 
i=0 


completely determine B(bo, by,..-,0n—1)- 
There is a simple relationship between the h-dimensional ballot numbers 
B(bo, 01,.--,bp—1) and B(bo, b1,...,bp-1). The lattice point 
Vv = (v0, U1,---;Uh-1) 
is nonnegative and strictly increasing if and only if the lattice point 
v =v-—(0,1,2,...,h-1)=v-—a* 
is nonnegative and increasing. It follows that 
a® =Vo,V1,V2,---,Vm =b 
is a path of strictly increasing vectors from a* to b if and only if 
0,v, —a*,vo—a"*,...,b—a* 
is a path of increasing vectors from 0 to b — a*. Thus, 
B(bo, b1, ..., bn—1) = B(bo, bi — 1, bo — 2,..., bn-1 — (h—1)). 


For 1 <i <j < Ah, let H;,; be the hyperplane in R” consisting of 
all vectors (1,...,@,) such that x; = x;. There are (3) such hyperplanes. 


A path 
a= Vo, V1, V2,---;Vm =b 


will be called intersecting if there exists at least one vector vz on the path 
such that v, € Hj,; for some hyperplane H;,;. 

The symmetric group S$; acts on R” as follows: For ¢ € S;, and v = 
(vp, V1,..-,Upn_1) € R*, let 


OV = (U9(0); Vo(1)) +++ 9 Vo(h—-1))- 


A path is intersecting if and only if there is a transposition tT = (7,7) € Sp 
such that tv; = vx for some lattice point vz on the path. 
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Let I(a,b) denote the number of intersecting paths from a to b. Let 
J(a,b) denote the number of paths from a to b that do not intersect any of 
the hyperplanes H;,;. Then 


P(a,b) = I(a,b) + J(a,b). 


Lemma 1. Let a be a lattice point in Z", and let b = (bo,...,bn—-1) be a 
strictly increasing lattice point in Z". A path from a to b is strictly increasing 
if and only if it intersects none of the hyperplanes H;,;, and so 


B(bo,...,6n—1) = J(a*,b). 
Proof. Let a= vo,V1,...,;Vm = b be a path, and-let 
Vk = (V0,k; V1,ky +++) Uh—1,k) 


for k = 0,1,...,m. If the path is strictly increasing, then every vector on 
the path is strictly increasing, and so the path does not intersect any of the 
hyperplanes H;,;. Conversely, if the path is not strictly increasing, then there 
exists a greatest integer k such that the lattice point vz_1 is not strictly 
increasing. Then 1 < k < m, and 


Uj,k-1 S Vj—-1,k-1 
for some j = 1,...,4—1. Since the vector vx is strictly increasing, we have 
Uj-1,k SUj,n — I. 
Since vz_1 and vz are successive vectors in a path, we have 
Uj—-1,k—-1 S Vj—1,k 
and 
jk — 1S Ujb-1- 


Combining these inequalities, we obtain 


Usj,k—-1 S Vj-1,k-1 S Uj-1,k S Uj — 1S vp p-1- 
This implies that 
Uj,k—-1 = Vj—-1,k-1 


and so the vector vz_; lies on the hyperplane H;_1,;. Therefore, if b is a 
strictly increasing vector, then a path from a to b is strictly increasing if and 
only if it is non-intersecting, and so J(a, b) is equal to the number of strictly 
increasing paths from a to b. It follows that J(a*,b) is equal to the strict 
ballot number B(bo,...,bn—1). This completes the proof. 


Lemma 2. Leta and b be strictly increasing vectors. Then 
P(oa,b) = I(aa,b) 
for every o € Sp, on F id. 
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Proof. If a is strictly increasing and if o € S;, o # id, then ga is not strictly 
increasing, and so every path from oa to b must intersect at least one of the 
hyperplanes H;;. This completes the proof. 


Lemma 3. Leta and b be strictly increasing lattice points. Then 


S© e(a)I(oa, b) = 0. 


o€ESh 


Proof. Since a is strictly increasing, it follows that there are h! distinct lattice 
points of the form aa, where o € Sp, and none of these lattice points lies on 
a hyperplane H;,;. Let © be the set of all intersecting paths that start at any 
one of the h! lattice points ca and end at b. We shall construct an involution 
from the set 2 to itself. 

Let o € Sp, and let 


0aA=V0,V1;---;Vm =b 


be a path that intersects at least one of the hyperplanes. Let k be the least 
integer such that v, € H;,; for some i < j. Then k > 1 since a is strictly 
increasing, and the hyperplane H;; is uniquely determined since v; lies on a 
path. Consider the transposition rT = (i, 7) € S;,. Then 


TVe = Ve © His, 
Toa #oa, 
and 
TOa = TV0,TV1,---;TVk = Vk; Vkt1;-+-;Vm =b 


is an intersecting path in Q from toa to b. Moreover, k is the least integer 
such that a lattice point on this new path lies in one of the hyperplanes, 
and H;,; is still the unique hyperplane containing v;,. Since 7? is the identity 
permutation for every transposition 7, it follows that if we apply the same 
mapping to this path from toa to b, we recover the original path from oa 
to b. Thus, this mapping is an involution on the set 2 of intersecting paths 
from the h! lattice points ca to b. Moreover, if o is an even (resp. odd) 
permutation, then an intersecting path from ga is sent to an intersecting 
path from roa, where 7 is a transposition and so ro is an odd (resp. even) 
permutation. It follows that the number of intersecting paths that start at 
even permutations of a is equal to the number of intersecting paths that start 
at odd permutations of a. This means that 


S> I(ca,b)= > I(ea,b). 


oeSh oESh 
e(a)=1 e(a)=—-1 


This statement is equivalent to the Lemma. 
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Let [x], denote the polynomial x(a — 1)---(a — k + 1). If b;,0(é) are 
nonnegative integers, then 


[Diloca) = bi(bi — 1) (bi — 2) --- (0; — o(@) + 1) 


7 jane —a(i))! if ofa) <b; 
0 if o(i) > by. 


Theorem 1. Let h > 2, and let bo, b1,...,bn—1 be integers such that 
0 < bo < by < +++ < bp. 
Then 


B(bo +61 + +++ + bp-1) = 


(bp + bi ++++ + bn—1 — (8))! Tl 


Bolby!--- bya! (bj — bi). 


0<i<j<h—-1 


Proof. Let a* = (0,1,2,...,2 — 1), and let b = (bo,01,...,0n,-1) € Z*. 
Applying the previous lemmas, we obtain 


B(bo, bi, --ba—1) = 
= J(a*,b) 
= P(a*,b) — I(a*,b) 


= P(a*,b) + S© e(a)I(ca*,b) 


a ss HA (bo +--+ + bn — (8))! 


265), Tio (bi — o(@))! 
aa*<b 
bo +-+-+ ba_i — (5)}! 
7 ( 5b bal ) Ye €(2) bolo) lea) **- [bh—t]e(n—1) 
0-91 h-1 7eS), 


ay — 
bb Be nal )) S© €(0)[bolo(oy Bilo(ay + ** [bn—alo(n—1) 
0-41: h—-1- oeSp, 
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1 [bo]: = [bol2_ +++ [bo]n—1 
— (bo tees + bp-1 — (B))E J 4 [Oa [baJ2 + [Jaa 
Sellar 
1 [bp—ala [bn—iJ2 «++ [bn—a]n—1 


7 (bo +--+ + dni — (8)! II (b; — b;). 


bo!lbi! +--+ bp_1! 
eon iat 0<i<j<h—1 


This completes the proof. 


We state the following corollary with the notation that is used later in 
the proof of the Erdés-Heilbronn conjecture. 


Corollary 1. Let h > 2, let p be a prime number, and let i9,11,...,tn—1 be 
integers such that 


O0<%9 <4 < +++ <ip_1 <p 
and 

to +1 tes tp-1 < 9 + p. 
Then 


B(io, i1,--.,tn-1) #0 (mod p). 


Proof. This follows immediately from the Theorem. 


3. A Review of Linear Algebra 


Let V be a finite-dimensional vector space over a field F’, and let T: V > V 
be a linear operator. Let J : V — V be the identity operator. For every 
nonnegative integer i, we define T* : V > V by 


T°(v) = I(v) =v, 
Pwyaae wy) 
for all v € V. To every polynomial 
p(t) = Cpt” + cp_1a""1 +--+ +e¢ +9 € Fiz] 
we associate the linear operator p(T) : V > V defined by 
P(T) = CrT” + Cp-1T” | +++ teat + ool. 


The set of all polynomials p(a) such that p(T) = 0 forms a nonzero, proper 
ideal J in the polynomial ring F[z]. Since every ideal in F'[2] is principal, 
there exists a unique monic polynomial pr(x) = pr,v(x) € J such that pr(x) 
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divides every other polynomial in J. This polynomial is called the minimal 
polynomial of T over the vector space V. 

A subspace W of V is called invariant with respect to T if T(W) C W, 
that is, if T(w) € W for all w € W. Then T restricted to the subspace W is a 
linear operator on W with minimal polynomial pr w (zx). Since pry (T)(w) = 
0 for all w € W, it follows that Pp w(x) divides pr y(x), and so 


deg(pr,w) < deg(pr.v), (2) 


where deg(p) denotes the degree of the polynomial p. 

For v € V, the cyclic subspace with respect to T generated by v is the 
smallest subspace of V that contains v and is invariant under the operator T’. 
We denote this subspace by Cr(v), Let v; = T*(v) for i = 0,1,2,.... Then 
Cr(v) is the subspace generated by the vectors 


{v,T(v),T?(v), T(v),...+ = {vo, V1, V2, v3,---$ 


and 
dim(Cr(v)) = l, 
where / is the smallest integer such the vectors vo,vi,...,Vv; are linearly 
dependent. This means that there exist scalars co,c1,...,¢cj—1 in the field F 
such that 
vi + c_1Vvi-1 +++ +¢1V1 + Covo = O. 
Let 
p(t) = #! + q_ya' 1 +---+e2+ ep. 
Then 
p(T)(vo) = T'(vo) + 1_-1T" 1 (vo) +++» + e1T (vo) + col (vo) 
=vit+c-1Vi-1 + +++ +C1V1 + CoVo 
= 0, 
and so 


p(T) (vi) = p(T)(T(vo)) = T'(p(T)(vo)) = T'(0) = 0 


for i = 0,1,2,.... Therefore, p(T’) = 0 on the cyclic subspace Cr(v) = C, 
and so p(x) is divisible by the minimal polynomial Pr.c(x), and 


m = deg(pr,c) < deg(p) = I. 
On the other hand, since 
pr,o(T)(v) = 9, 
it follows that the vectors vo, V1,..-,Vm are linearly dependent, and so 


l<m. 
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This implies that | = m and so, by inequality (2), 


dim(Cr(v)) = deg(pr,c) < deg(pr,v) 


for allv € V. 

If T(£) = af for some a € F and some nonzero vector f € V, then a is 
called an eigenvalue of T and f is called an eigenvector of T with eigenvalue a. 
The spectrum of T, denoted o(T), is the set of all eigenvalues of T. If V has 
a basis consisting entirely of eigenvectors of JT’, then T is called a diagonal 
operator. 

The following inequality plays a central role in the proof of the Erdés- 
Heilbronn conjecture. 


Lemma 4. Let T be a diagonal linear operator on a finite-dimensional vector 
space V, and let o(T) be the spectrum of T. Then 


dim(Cr(v)) < |o(7)| (3) 
for everyv EV. 


Proof. Let a € o(Tf), and let f be an eigenvector with eigenvalue a. Let W 
be the one-dimensional subspace generated by f. Then W is invariant with 
respect to T, and prw(x) = x —a. It follows that x — a divides pry (x), 


and so 
Il (x — a) 


a€o(T) 


divides pr,y(x). Let dim (V) =k. If T is a diagonal linear operator, then V 


has a basis {fo, fi,...,f,-1} of eigenvectors, and 
I] @- (fi) =0 
a€o(T) 


for i = 0,1,...,4—1. It follows that [[oe¢m(T — a)(v) = 0 for all v € V, 
and so 


Priv(£) = II (a — a). 
a€o(T) 


In particular, the degree of pry(x) is equal to the number of distinct 
eigenvalues of T. It follows that, if T is a diagonal operator on a finite- 
dimensional vector space V, then 


dim(Cr(v)) < deg(pr,v) = |o(T)| 
for every v € V. This completes the proof. 


Lemma 5. Let T: V > V be a linear operator on the vector space V, and 
let {fo,fi,...,f,-1} be eigenvectors of T with distinct eigenvalues. Let 


vo =fot+fit+---+fr-1, 
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and let Cr(vo) be the cyclic subspace generated by vo. Then 
dim(Cr(vo)) =k 

and 

{Vo, T(vo), T*(vo), mene T*~*(vo)} 
is a basis for Cr(vo). If dim(V) =k, then Cr(vo) =V. 
Proof. We first show that the vectors fo, f1,...,£,—1 are linearly independent. 
If they are linearly dependent, then there is a minimal subset of the vectors 
fo,...,f,-1 that is linearly dependent, say, fo,...,fj-1. Moreover, | > 2 since 


f; 40 for i=0,1,...,4—1. There exist nonzero scalars co, ¢1,...,¢Ci—-1 such 


that pe cif; = 0. Let a; € o((T)) be the eigenvalue corresponding to the 


eigenvector f;. Then 


l-1 l-1 l-1 
1=0 i=0 


i=0 
Since 
1-1 1-1 
Ss cjay—1f; = aj_1 ~ cif; = 0, 
i=0 i=0 
it follows that 
I-1 I-2 
S- ci(a; — a1) fi = S- ci(a; — a1) f; = 0, 
i=0 i=0 


which contradicts the minimality of 1, since c;(a; — aj-1) # 0 for i < 1 — 
1. Thus, the vectors fo,...,f,-1 are linearly independent, and span a k- 
dimensional subspace W of V. Moreover, W is an invariant subspace, since 
it has a basis of eigenvectors of T. Since 


vo =f, +---+f, € W, 
it follows that 
Cr(vo) CW 
and so 
dim(Cr(vo)) < dim(W) = k. 
The vector T*(vo) € Cr(vo) for every nonnegative integer 7. Since 
T'(vo) = apfo + aif, +.+>+a4_yfr—-1, 


the matrix of the set of vectors {vo,T(vo),T?(vo),..-,;T*~'(vo)} with 
respect to the basis {fo,...,f,-1} is 
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1 ao ag vee a, 
la, ajo: ay 
1 ag az tee a 


1 ag_-1 Cran ne py 
and its determinant is the Vandermonde determinant 


I] @-a) 40. 


O<i<j<h 


It follows that {vo,T(vo),T?(vo),--.,T*~1(vo)} is a set of linearly indepen- 
dent vectors, and so 


dim(Cr(vo)) > k = dim(W). 


Therefore, dim(Cr(vo)) = k. If dim(V) = &, then Cr(vo) = V. This 
completes the proof. 


4. Alternating Products 


Let A”’V denote the h-th alternating product of the vector space V. Then 
AV is a vector space whose elements are linear combinations of expressions 
of the form 


Vo Avi A+++ AVp-1; 
where vo, V1,---;Va—1 € V. These wedge products have the property that 
vo Avi A+++: AVpn_1 = 0 
if v; = vj; for some 7 # j, and 
Vo(0) A Vo(1) A+++ A Vo(n—1) = E(0)Vo Avi Avs? AVa-1 


for allo € Sp. 
If {e9,...,e4-1} is a basis for V, then a basis for \”V is the set of all 
wedge products of the form 


ei, Nei, Ao A Cina; 
where 
O<% <i, <<... < ind Sk 1, 


and 


dim(A’V) = B 
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Every linear operator T : V > V induces a linear operator 


h h 
pT: \v3AV 
that acts on wedge products according to the rule 


DT (Vip A+++ A Vin) = 
h-1 
> Vio Arse A Vin mx T(vi;) A Vijgn °° A Vin-1- (4) 
j=0 


The operator DT is called the derivative of T. 


Lemma 6. Let T be a diagonal linear operator on V, and let o(T) be the 
spectrum of T. For h > 2, let DT: rit Vo A" V be the derivative of T. If 
T has distinct eigenvalues, that is, if |o(T)| = dim(V), then 


o(DT) = h*o(T) 
and 
|h“o(T)| = dim(Cpr(w)) (5) 
for everyw Ee A"V. 


Proof. Let o(T’) = {ao,a1,...,@~—1}, and let {fo,f,...,f,-1} be a basis of 
eigenvectors of V such that T(f;) = aj;f; for i = 0,1,...,k —1. Then (4) 
implies that 


DT(f; \- 2a (linea) = (Gio Ble eee + din, )(Fio Avs A fis 24) 


It follows that DT is a diagonal linear operator on A”V, and its spectrum 
o(DT) consists of all sums of h distinct eigenvalues of T, that is, 


o(DT) = ho(T). 


Applying inequality (3) to the vector space A”V and the operator DT, we 
obtain 


|n“o(T)| = |o(DT)| = dim(Cpr(W)) 


for every w € A’V. This completes the proof. 


Theorem 2. Let T be a linear operator on the finite-dimensional vector 
space V. Let h > 2, and let DT : A°V > A"V be the derivative of T. 
For vo € V, define 


vV,= T'(vo) EV 
fori > 1, and let 


W=VoAvi A+++ AVn-1 € AV. 
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Then for every r > 0 
(DT)"(w) = (DT)"(vo Avi A+++ A Vn-1) 
= S> Blios in, ate th-1) Vig \ Vit Ars Vin 


where the sum is over all integer lattice points (ig,i1,...,in-1) € Z" such 
that 


O< ig <iy <>< ip Srt+h—-1 


and 

: ; : h 

to +t Fete + tp-1 = 9 +r, 
and where Blio, i1,-.-,tn—-1) ts the strict h-dimensional ballot number corre- 
sponding to the lattice point (io, %1,..-,%n—1). 


Proof. The proof will be by induction on r. Let r = 0. Since 
B(0,1,2,...,h-1) =1, 
we have 
(DT)°(w) =w 
=voAviA-:::-AVp-1 


= B(0,1,2,...,2-l)vo Avi A+++ AVaaa. 
Suppose the result holds for some integer r > 0. Then 
(DT)"**(w) = DT((DT)’(w)) 


= DT (~ Blio, i1, aa -5th—-1) Vio A Vit A+++ A Vina) 


= S> Blio, in, yh »tn-1) DT (Vig AVi, Act A Vin_1) 


= S" Bostiy-: site) aM Av ‘A Vijz-4 AT (viz) A Vijas Aves A Vi;_1) 
j=0 


= De BG os tive ->¢n—1) (Vig A+ “A Vijz-4 AN Vigaa AN Viza1 A+ ‘A Vi,_1) 


= ) C(io,21,---,tn—1)Vig A Viz Ao+* A Ving 


where the last sum is over all integer lattice points (to, %1,...,ipn-1) € Z” 
such that 


O<i9 <4 < +++ <tpn-i<rth 
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and 
ott tes +tp-1 = 9 +r-+1, 


and the integer C(io,71,...,in—1) satisfies the difference equation 
h-1 
C(iostay+-+ytn—1) = > Blt, ---54j-154j — 1, 4j44,--+ 44-1) 
j=0 


This difference equation determines the strict h-dimensional ballot numbers, 
and so 


C(io,41,---;4n-1) = B(to, t1,---,¢n—1)- 


Therefore, the result holds in the case r+ 1. This completes the induction. 


5. Erd6os-Heilbronn, Concluded 
Theorem 3. Let p be a prime number, and let A C Z/pZ, where |A| = k. 
Let2<h<k. Then 

|p“ A| > min(p, hk — h? +1). 


Proof. Let A = {ao,a1,...,@%—-1}. Let V be a vector space of dimension k 
over the field Z/pZ, and let {fo, fi,...,f;-1} be a basis for V. We define the 
diagonal linear operator T : V > V by 


T(£;) = aif; 
for 7 =0,1,...,k—1. The spectrum of T is 
o(T) =A. 
Let 
vo =fotfit---+ fri, 
and define 


Vi4-1 = T (vi) = T’(vo) 


for 1 > 0. By Lemma 5, the cyclic subspace generated by vo is V, and the set 
of vectors {Vo, V1,---,Vk—1} is a basis for V. The alternating product ie V 
is a vector space with a basis consisting of the (°) wedge products of the 
form 


Vio A Vix Ave A Vin_a> 
where 


O<ig9 < iy << tp_y < K-1. 
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Let 
h 
w=voAviA-:-Avni € AV. 


By inequality (5), 
|h\A| = |o(DT)| > dim Cpr(w). 
Therefore, it suffices to prove that 
dim Cpr(w) > min(p, hk — h? +1). 
This is equivalent to proving that the vectors 
w, (DT)(w), (DT)?(w),...,(DT)"(w) 
are linearly independent in the alternating product i V, where 
n = min(p, hk — h? +1) —1 = min(p— 1, hk — h?). 


Let 0<r<n. By Theorem 2, the vector (DT)"(w) is a linear combination 
of the vectors 


Vio A Vix Ar++AN\Vin_y 


such that 
O<i9 <iy <<... <in_y <rt+h—-1 
and 
. ; . h 
yy ters +tp-1 = 2 +r. (6) 


Let I be the interval of integers [0, k — 1]. Since 


rAT = (3) ae we) = i) + [0, hk — A”), 


it follows that there is at least one basis vector vi, A vi, A+++ A Vin in the 
expansion of (DT')"(w) such that 


O<ig <ty < +++ Sind Sk -1<k<p 


; h h h 
to Fay +++ +tp-1 = 9 tres 9 +n< 9 + p. 


By Theorem 2, the coefficient of this basis vector is the strict h-dimensional 
ballot number B(io, i1,...,%,—-1) and 


and 


B(io,t1,.--,tn-1) #0 (mod 7) 
by Corollary 1. 
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Suppose that the vector v;, Avi, A: --AVin, satisfies 1; > k for some !. Since 
every vector v; € V with / > k is a linear combination of vo, v1,...,V,—1, it 
follows that v;, A vi, A+++: A Vin, is a linear combination of basis vectors of 
the form vj, A vj, A-+: A Vj,_,, where 


O< jo<ji<-++<jn-1<k-1 
and 

: : . h 

Jot ji tes+ + Jn-1 < 9 +r. 


It follows that (DT)"(w) is a linear combination of basis vectors vi, A vi, A 
--+Av;,_, Such that either 


h 
dott +e +1 < (5) +r 
or 
: . ; h 
to +t Fete + tp-1 = 9 +r, 


and in the latter case the basis vector appears with a coefficient that is 
nonzero modulo p. This implies that the vectors w, (DT)(w),...,(DT)"(w) 
are linearly independent in the cyclic subspace Cpr(w), and the proof of the 
Erdos-Heilbronn conjecture is complete. 


6. Remarks 


This proof only requires that A be a subset of a field, and does not require 
that the field be Z/pZ. Let F be an arbitrary field. Let p be the characteristic 
of F' if the characteristic is positive, and let p = oo if the characteristic is 
zero. Then we have, in fact, proved that if A C F and |A| = k < p, then 
|h’ A| > min(p, hk — h? +1) for all h > 1. 

The Cauchy-Davenport theorem was proved by Cauchy [3] in 1813. 
Davenport [5] rediscovered the result in 1935. I. Chowla [4] immediately 
extended the Cauchy-Davenport theorem to composite moduli. Other gener- 
alizations have been obtained by Pillai [14], Shatrovskii [20], Pollard [15, 16], 
Brakemaier [2], and Hamidoune [12]. Davenport [6] discovered in 1947 that 
Cauchy had proved the Cauchy-Davenport theorem first. 

The Erd6s-Heilbronn conjecture originated in the 1960s. Partial results on 
the Erdés-Heilbronn conjecture were obtained by Rickert [18], Mansfield [13], 
Rédseth [19], Pyber [17], and Freiman, Low, and Pitman [11]. Dias da Silva 
and Hamidoune [8] proved the conjecture by using results from representation 
theory and linear algebra. This algebraic technique had previously been 
applied to additive number theory by Dias da Silva and Hamidoune [7] and 
Spigler [21]. 
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The derivation of the formula for the strict ballot number B (bo,---;On—1) 


follows a paper of Zeilberger [22]. 


Using combinatorial ideas from the proof of Dias da Silva and Hamidoune, 


Alon, Nathanson, and Ruzsa [1] found a different proof of the Erdés- 
Heilbronn conjecture that uses only the simplest properties of polynomials. 
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1. Introduction 


Let $;, be the symmetric group of n letters. Landau considered the function 
g(n) defined as the maximal order of an element of S,,; Landau observed that 
(cf. [9]) 


g(n) = max lem(m1,..., Mx) (1) 


where the maximum is taken on all the partitions n = m, +m2+---+m sz 
of n and proved that, when n tends to infinity 


log g(n) ~ \/nlogn. (2) 


More precise asymptotic estimates have been given in [11, 22, 25]. In [25] and 
[11] one also can find asymptotic estimates for the number of prime factors of 
g(n). In [8] and [3], the largest prime factor P*(g(n)) of g(n) is investigated. 
In [10] and [12], effective upper and lower bounds of g(n) are given. In [17], it 
is proved that limps. g(n+1)/g(n) = 1. An algorithm able to calculate g(n) 
up to 10/° is given in [2] (see also [26]). The sequence of distinct values of 
g(n) is entry A002809 of [24]. A nice survey paper was written by W. Miller 
in 1987 (cf. [13]). 

My very first mathematical paper [15] was about Landau’s function, and 
the main result was that g(n), which is obviously non decreasing, is constant 
on arbitrarily long intervals (cf. also [16]). I first met A. Schinzel in Paris in 
May 1967. He told me that he was interested in my results, but that P. Erdés 
would be more interested than himself. Then I wrote my first letter to Paul 
with a copy of my work. I received an answer dated of June 12 1967 saying “I 
sometimes thought about g(n) but my results were very much less complete 
than yours”. Afterwards, I met my advisor, the late Professor Pisot, who, in 
view of this letter, told me that my work was good for a thesis. 

The main idea of my work about g(n) was to use the tools introduced 
by S. Ramanujan to study highly composite numbers (cf. [19, 20]). P. Erdés 
was very well aware of this paper of Ramanujan (cf. [1, 4-6]) as well as of 
the symmetric group and the order of its elements, (cf. [7]) and I think that 
he enjoyed the connection between these two areas of mathematics. Anyway, 
since these first letters, we had many occasions to discuss Landau’s function. 
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Let us define ny = 1, no = 2, n3 = 38, ng = 4, N5 = 5, NG = 7, ete. ..., Nx 
(see a table of g(n) in [16, p. 187]), such that 
g(x) > g(r — 1). (3) 


The above mentioned result can be read: 

lim(ng41 — Ne) = +00. (4) 
Here, I shall prove the following result: 
Theorem 1. 


lim(ng41 — Ne) < +00. (5) 


Let us set py = 2,p2 = 3,p3 = 5,..., pe = the k-th prime. It is easy to 
deduce Theorem 1 from the twin prime conjecture (i.e. lim(pz41 — pr) = 2) or 
even from the weaker conjecture lim(p,41—pr) < +00. (ef. Sect. 4 below.) But 
I shall prove Theorem 1 independently of these deep conjectures. Moreover I 
shall explain below why it is reasonable to conjecture that the mean value of 
Nk+1 — Ny is 2; in other terms one may conjecture that 


and that np41 — nx = 2 has infinitely many solutions. Due to a parity 
phenomenon, nz41 — nz seems to be much more often even than odd; 
nevertheless, I conjecture that: 


lim(ng4 — ny) = 1. (7) 


The steps of the proof of Theorem 1 are first to construct the set G 
of values of g(n) corresponding to the so called superior highly composite 
numbers introduced by S. Ramanujan, and then, when g(n) € G, to build 
the table of g(n+d) when d is small. This will be done in Sects. 4 and 5. Such 
values of g(n +d) will be linked with the number of distinct differences of the 
form P—Q where P and Q are primes satisfying r—r7* << Q<a< P< 242%, 
where x goes to infinity and 0 < a < 1. Our guess is that these differences 
P—Q represent almost all even numbers between 0 and 22°, but we shall 
only prove in Sect. 3 that the number of these differences is of the order of 
magnitude of x, under certain strong hypothesis on x and a, and for that a 
result due to Selberg about the primes between x and x + 2° will be needed 
(cf. Sect. 2). 

To support conjecture (6), I think that what has been done here with 
g(n) € G can also be done for many more values of g(n), but, unfortunately, 
even assuming strong hypotheses, I do not see for the moment how to 
manage it. 

I thank very much E. Fouvry who gave me the proof of Proposition 2. 
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1.1 Notation 


p will denote a generic prime, pz the k-th prime; P,Q, P;,Q; will also denote 
primes. As usual 7(x) = )7,,<, 1 is the number of primes up to 2. 

|S| will denote the number of elements of the set S. The sequence ny is 
defined by (3). 


2. About the Distribution of Primes 


Proposition 1. Let us define r(x) = ae 1, and let a be such that é < 
a<1, ande>0. When E goes to infinity, and &' = € + E/log€, then for all 
ax in the interval [€,€'] but a subset of measure O((€’ — €)/log® €) we have: 


a ge re 
m(a +a°) — r(x) pee nd ipsa (8) 
a x A 
n(x) — a(a — 2) ipa < ae (9) 
zr Qk —Q*r1 Ja . 
S ll dk>2. 1 
log x log Q log? x for all primes Q, and k > (10) 


Proof. This proposition is an easy extension of a result of Selberg (cf. [21]) 
who proved that (8) holds for most x in (&, €’). In [18], I gave a first extension 
of Selberg’s result by proving that (8) and (9) hold simultaneously for all x in 
(€,€’) but for a subset of measure O((€’ — €)/ log? ). So, it suffices to prove 
that the measure of the set of values of x in (€,&') for which (10) does not 
hold is O((€' — €)/ log? €). 

We first count the number of primes @ such that for one k we have: 


k _ ak-l / 
eo, lean aia oe 
log& — log Q ~ log €’ 
If Q satisfies (11), then k < 8 for €! large enough. Further, for k fixed, 


log 


(11) implies that Q < (€’)!/*, od the total number of solutions of (11) is 


(11) 


log €’/ log 2 
< eM =e) =O. 
k=2 
With a more careful estimation, this upper bound could be improved, but this 


k k-1 
crude result is enough for our purpose. Now, for all values of y = eo ae 


satisfying (11), we cross out the interval (y pate was . We also 


The total sum of 


cross out this interval whenever y = me and y = me er 


the lengths of the crossed out intervals is O (aS): which is smaller than 
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the length of the interval (ck. x) and if ina does not fall into one of 
these forbidden intervals, (10) will certainly hold. Since the derivative of the 


function p(x) = 2/ log is y/(x) = nee — ay and satisfies y' (x) ~ cee for 


all x € (€,€’), the measure of the set of values of x € (€,€’) such that y(a) 
falls into one of the above forbidden intervals is, by the mean value theorem 


O (Sz); and the proof of Proposition 1 is completed. 


3. About the Differences Between Primes 
Proposition 2. Suppose that there exists a,0 <<a<1, and x large enough 
such that the inequalities 

m(a+a°) —r(x) > (1—e)a*/loga (12) 

n(x) —m(a@— 2%) > (1—e)ar*/loga (13) 
hold. Then the set 

E = E(a,a) = {P—Q;P,Q primes, r-—a* <Q<a<P<a+a} 
satisfies: 
|E| > Coa 


where Cz = Cya*(1 —¢)* and C, is an absolute constant (C, = 0.00164 
works). 


Proof. The proof is a classical application of the sieve method that Paul 
Erdés enjoys very much. Let us set, for d < 2x%, 


rid) =[{(P,Q);a-a2° <Q<a<P<a+e2°,P—Q=d}|. 
Clearly we have 
JEI= Sool (14) 
0<d<2x% 
r(d)A0 
and 
S- r(d) = (x(a + 2%) — n(x))(4(x) — (a — &%)) > (1 — €)?2?*/ log? x. 
0<d<2a% 
(15) 
Now to get an upper bound for r(d), we sift the set 
A={njx-—a* <n<a} 
with the primes p < z. If p divides d, we cross out the n’s satisfying n = 0 
(mod p), and if p does not divide d, the n’s satisfying 


n=0 (modp) or n=-—d (mod p) 
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so that we set for p < z: 


1 if p divides d 
w(p) = 


2 if p does not divide d. 


By applying the large sieve (cf. [14, Corollary 1]), we have 


| A] 
r(d) < ne 


with 


L(z)= 5° (1 + Snlaltz) u(r)? | a 


zo pin 2 WP) 


(1 is the Mobius function), and with the choice z = (3|A|)*/?, it is proved in 


[23] that 
Al : wap Ue 
L@ = atl (: (p— 7) log*(IAl) sig P~ 2 


p>2 


The value of the above infinite product is 0.6602... < 2/3. We set f(d) = 
I] Pld a 5, and we observe that |A| > 2° — 1, so that for x large enough 
p> 


3 f (4). (16) 


Now, for the next step, we shall need an upper bound for >>,,<, f?(n). By 
using the convolution method and defining ~ 


=> e@ sf f?(n/a) 


a|n 


one gets h(2) = h(2?) = h(23) = ... = 0 and, for p > 3, A(p) = Ceay, 
hie = ip) =... = 0, so that 


ae PP(n) ~ a D nti hla) = ae h(a) =| 
= Ma) _ p= 3 
coy MOTT (+) an) 
8 
= 2.63985...%< 3 


From (15) and (16), one can deduce 


ors Ysa LD te 
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which implies 


3a7274(1 — ¢)? 
d) > ——.—__—.. 
| 
0<d<2x 
r(d)AO0 


By Cauchy-Schwarz’s inequality, one has 


(ECS, fe) = nara 


0<d<20% 0<d<2a0% 
r(d)A0 r(d)A0 
and, by (14) and (17) 
4,,4a 1 4 38a/1 _ -\4 
ae (1 -«) Doig as a0. aie 
1,024| Al? 3 16,384 | A? 


Since |A| < #* + 1, and x has been supposed large enough, Proposition 2 is 
proved. 


4. Some Properties of g(n) 


Here, we recall some known properties of g(n) which can be found for instance 
in [16]. Let us define the arithmetic function ¢ in the following way: ¢ is 
additive, and, if p is a prime and k > 1, then ¢(p”) = p*. It is not difficult to 
deduce from (1) (ef. [13] or [16]) that 


= M. 1 
g(n) om (18) 


Now the relation (cf. [16, p. 139]) 

Meég(N) = (M’>M = &(M’)>&(M)) (19) 
easily follows from (18), and shows that the values of the Landau function 
g are the “champions” for the small values of @. So the methods introduced 
by Ramanujan (cf. [19]) to study highly composite numbers can also be used 


for g(n). Indeed M is highly composite, if it is a “champion” for the divisor 
function d, that is to say if 


M'<M => d(M’)<d(M). 


Corresponding to the so-called superior highly composite numbers, one 
introduces the set G: N € G if there exists p > 0 such that 


YM >1, ¢(M)—-plogM > &(N) — plogN. (20) 


Equations (19) and (20) easily imply that G C g(N). Moreover, if p > 2/ log 2, 
let us define x > 4 such that p = x/logax and 


Np = [[ vp” =] [” (21) 


pXa p 
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with 
0) if p>x 
Ap = 1 if beg Ses ree 
k>2 if So <p< te 


then N, € G. With the above definition, since x > 4, it is not difficult to 
show that (cf. [11, (5)]) 

pr <a (22) 
holds for p < x, whence N, is a divisor of the least common multiple of the 


integers < x. Here we can prove 


Proposition 3. For every prime p, there exists n such that the largest prime 
factor of g(n) is equal to p. 


Proof. We have g(2) = 2,9(3) = 3. If p > 5, let us choose p = p/logp > 
2/log 2. N, defined by (21) belongs to G C g(N), and its largest prime factor 
is p, which proves Proposition 3. 


From Proposition 3, it is easy to deduce a proof of Theorem 1, under the 
twin prime conjecture. Let P = p+ 2 be twin primes, and n such that the 
largest prime factor of g(n) is p. The sequence nz being defined by (3), we 
define k in terms of n by ng, <n < ng41, 80 that g(n~) = g(n) has its largest 
prime factor equal to p. Now, from (18) and (19), 


&(g(nk)) = rk 
and g(n, +2) > g(nx) since M = Eg(nx) satisfies M > g(n,) and 
2(M) = np + 2. So nezi < ne +2, and Theorem 1 is proved under this 
strong hypothesis. 
Let us introduce now the so-called benefit method. For a fixed p > 2/ log 2, 
N = N, is defined by (21), and for any integer M, 


M =|[p*, 
v7) 


one defines the benefit of WM: 


ben(M) = £(M) — €(N) — plog M/N. (23) 
Clearly, from (20), ben(M) > 0 holds, and from the additivity of @ one has 
ben(M) = >> (€(p*") — £(p*”) — p(Bp — ap) logp) . (24) 
p 


In the above formula, let us observe that &(p°) = p® if 8 > 1, but that 
&(p?) = 0 4 p® = 1 if B = 0, and, due to the choice of ap in (21), that, in 
the sum (24), all the terms are non negative: for all p and for 3 > 0, we have 


£(p°) — £(p®) — p(B — ap) log p > 0. (25) 
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Indeed, let us consider the set of points (0,0) and (3, p? log p) for 8 integer 
> 1. For all p, the piecewise linear curve going through these points is convex, 
and for a given p, a, is chosen so that the straight line L of slope p going 


through (ap, Ee) does not cut that curve. The left-hand side of (25), (which 


is ben(Np®~°?)) can be seen as the product of log p by the vertical distance 
of the point (3. ) to the straight line L, and because of convexity, we 
shall have for all p, 
ben(Np') >tben(Np), t>1 (26) 
and for p < a, 
ben(Np~*) >tben(Np~'), 1<t< ay. (27) 


5. Proof of Theorem 1 


First the following proposition will be proved: 


Proposition 4. Let a < 1/2, and x large enough such that (10) holds. Let 
us denote the primes surrounding x by: 

i <Q <0. <Q < QS t< P< BR<...<B<.... 
Let us define p = x/logz,N = N, by (21), n = &(N). Then forn<m< 
n+ 22%, g(m) can be written 
PP dP 
Qj, Qj. a Q5,, 


with r > 0 and ty <... <tr, ji <... < jr, Pi, Sv +4a%,Q5, > x — 4%. 


g(m) =N (28) 


Proof. First, from (18), one has ¢(g(m)) < m, and from (23) and (18) 


ben(g(m)) = &(g(m)) — &(V) ~ plog 22) 
forn <m < 22%. 

Further, let Q < x be a prime, and k = ag > 1 the exponent of Q in 
the standard factorization of N. Let us suppose that for a fixed m, @ divides 
g(m) with the exponent Bg =k+t,t > 0. Then, from (24), (25), and (26), 
one gets 


<m—n< 22% (29) 


ben(g(m)) > ben(NQ*) > ben(NQ) (30) 
and 


ben(VQ) = Q**? — Q* — plogQ 


_ QR —Q* 
ete (a -»). 
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From (21), the above parenthesis is nonnegative, and from (10), one gets: 


ben(NQ) > log 2 mG , (31) 
log” x 


For x large enough, there is a contradiction between (29), (30) and (31), and 
so, Bg < ag. 
Similarly, let us suppose Q < 7, k =ag >2 and 6g =k—-t,1<t<hk. 
One has, from (24), (25) and (27), 
ben(g(m)) > ben(NQ~*) > ben(NQ™') 
and 
ben(NQ7*) = Q*-! — Q* + plogQ 
Qt _ QR} 
log Q 


which contradicts (29), and so, for such a Q, bg = ag. 
Now, let us suppose Q < z,ag = 1, and Bg = 0 for some m,n < m< 
n+ 2%. Then 


ben(g(m)) > ben(NQ~') = —Q + plogQ = y(Q) 


by setting y(t) = plogt — t. From the concavity of y(t) for t > 0, for x > e?, 
we get 


y(Q) = y(a) + (Q—2)y'(z) = (Q- 2) (¢ = 1) 


JE 


log! x 


= 1089 (» ) 2 t0e2 


and so, 


which, from (29) yields 
r—-Q< 4a. 


In conclusion, the only prime factors allowed in the denominator of arn) are 
the Q’s, with x — 4%° <Q <a, andag = 1. 

What about the numerator? Let P > x be a prime number and suppose 
that P' divides g(m) with t > 2. Then, from (26) and (23), 


ben(Np') > ben(Np?) = P? — 2p log P. 
But the function t +> t? — 2plogt is increasing for t > \/p,; so that, 
ben(N P*) > 2? — 2x > 22% 
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for « large enough, which contradicts (29). The only possibility is that P 
divides g(m) with exponent 1. In that case, from the convexity of the function 
z(t) =t — plogt, inequality (26) yields 


ben(g(m)) > ben(NP) = 2(P) > 2(x) + (P — z)2"(z) 
= (P-2) (1-2) 3 P-o 


for x > e?, which, with (29), implies 


P—«< 4a. 


Up to now, we have shown that 


Qj, --- Qi, 
with Pi. < «#+42°,Q;, > x — 4x%. It remains to show that r = s. First, 
since n <m <n-+ 2x%, and N belongs to G, we have from (18) and (19) 


n < l(g(m)) < n+ 2a. (32) 
Further, 


t=1 t=1 
and since r < 4a, and s < 42°, 
l(g(m)) —n < r(a + 4%) — s(a — 42%) 
<(r—s)x + 322°, 


From (32), €(g(m)) —n > 0 holds and as a < 1/2, this implies that r > s for 
x large enough. Similarly, 


E(g(m)) —n > (r—s)a, 


so, from (32), (r — s)a must be < 2x, which, for x large enough, implies 
r <s; finally r = s, and the proof of Proposition 4 is completed. 


Lemma 1. Let x be a positive real number, a,,a2,...,dx,61,b2,...,b% be 
real numbers such that 


by < bey <1. SDL <a <a, <a< 


< ar 


and A be defined by A= Sa; —b;). Then the following inequalities 


cr+taA ae thy A 
<|[-< = 
x 4 + < exp (2) 


hold. 
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Proof. It is easy, and can be found in [16, p. 159]. 


Now it is time to prove Theorem 1. With the notation and hypothesis of 
Proposition 4, let us denote by B the set of integers M of the form 
P,, Pi, ... PB; 
M = N41? 
Qi Qin --- Qi, 
satisfying 


&(M) — &(N) = S°(P;, — Qj.) < 20%. 


t=1 
From Proposition 4, for n <_m < 2a, g(m) € B, and thus, from (18), 


= M. 33 
g(m) Poe (33) 
MeB 


Further, for 0 < d < 22%, define 
Ba={Me€B: ¢(M) — &(N) = d}. 


I claim that, if d < d’ (which implies d < d' — 2), any element of Ba is smaller 
than any element of By. Indeed, let M € Bg, and M’ € By. From Lemma 1, 
one has 


M' 5 ttd lS etdt2 
er x , 
Since d < 2x% < x, and e a See cear one gets 


Me 1 x 
N7~ Sioge ea 


This last quantity is smaller than fue if (d +1)? < 2x +1, which is true 
for « large enough, because d < 2”% and a < 1/2. 

From the preceding claim, and from (33), it follows that, if By is non 
empty, then 


g(n+ d) = max By. 
Further, since N € G, we know that n = (NV) belongs to the sequence (n,) 


where g is increasing, and so, n = ng. If 0 < di < dg <... < dg < 2x% 
denote the values of d for which By is non empty, then one has 


Nig, = O+0,,1 Si < 5, (34) 


Suppose now that a < 1/2 and « have been chosen in such a way that 
(12) and (13) hold. With the notation of Proposition 2, the set E(x,a) is 
certainly included in the set {d1,d2,...,d;}, and from Proposition 2, 


s > Coxr® (35) 
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which implies that for at least one 2, dj41 —di < ee and thus 


Nkotit1 — Nkoti S Z- 
C2 


Finally, for é <a< 5, Proposition 1 allows us to choose x as wished, and 
thus, the proof of Theorem 1 is completed. 

With € very small, and a close to 1/2, the values of Ci and C2 given in 
Proposition 2 yield that for infinitely many k’s, 


Nk+1 — Nk < 20,000. 


To count how many such differences we get, we define 
y(n) = Card{m <n: g(m) > g(m-— 1)}. 


Therefore, with the notation (3), we have ny.) =n. 
In [16, 162-164], it is proved that 


where 7 is such that the sequence of consecutive primes satisfies pj41 — pi 
<p. Without any hypothesis, the best known 7 is > 1/2. 


Proposition 5. We have y(n) > n°/4~€ for all e > 0, and n large enough. 
Proof. With the definition of y(n), (34) and (35) give 
y(n + 2a°) — 7(n) = 8 > 2 (36) 


whenever n = £(N), N = N,,p = 2/logza, and z satisfies Proposition 1. But, 
from (21), two close enough distinct values of x can yield the same N. 

Tnow claim that, with the notation of Proposition 1, the number of primes 
p;, between € and €’ such that there is at least one x € [p;,pi4i) satisfying 
(8), (9) and (10) is bigger than $((é’) — 1(€)). Indeed, for each i for which 
[pi, Pi+1) does not contain any such x, we get a measure pj41 — p; > 2, and if 
there are more than 4(7(€’)—7(€)) such is, the total measure will be greater 
than 7(€’) — 1(€) ~ €/ log? €, which contradicts Proposition 1. 

From the above claim, there will be at least $(m(€’) — 1(€)) distinct N’s, 
with N = N,,p = 2/logaz, and € < « < €’. Moreover, for two such distinct 
N, say N’ < N”, we have from (21), €(N”) — €(N’) > €. 

Let N@ and N©) the biggest and the smallest of these N’s, and n@ = 
(NO), n© = e(N)), then from (36), 


1 
rate tan 


y(n) > y(n) = y(n) > eS et 
log* € 


(m(6") — m(§))E 
But from (21) and (22), « ~ log N,, and from (2), 


x~logN,~ Vnlogn with n=£(N,) 


(37) 
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sO 


én~ Vn® log n@® log n@) 


and since a can be choosen in (37) as close as wished of 1/2, this completes 


the proof of Proposition 5. 
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Dedicated to Paul Erdés on the occasion of his 80th 
birthday 


1. Introduction 


Our first joint paper with Erdés appeared in 1966. It was a triple paper with 
Szemerédi written on divisibility properties of sequences of integers which 
is one of Erdés’ favorite subjects. Nine further triple papers written on the 
same subject followed it, and since 1966, we have written altogether 52 joint 
papers with Erdés. On this special occasion I would like to return to the 
subject of our very first paper. In Sect. 2, I will give a survey of the related 
results, while in Sect. 3, I will study a further related problem. 


2. Survey of the Results on Divisibility Properties 
of Sequences of Integers 


Throughout this paper, the following notations will be used: N denotes the 
set of the positive integers. c,,c2,... denote positive absolute constants. If 
f(x) = 0(g(x)), then we write f(x) < g(x). If ACN, n EN, then we write 


s(A,n) = S- 


ac€A,a<n 
1 
HAn)= DP ao 
ac A,2<a<n 8 
1 
s(A) = = 
acA 
and 
1 
Ay ds loga 
ac€A,2<a 8 
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For n > 3 we put 
&(A,n) = s(A,n)(logn)~* 
and 
m(A, n) = t(A,n)(loglogn)~* 


These quantities can be called as “logarithmic density” and “log log density” 
of A. In fact, both of them can be considered as densities since if the infinite 
sequence A C N possesses asymptotic density, then the limit of both ¢(.A,n) 
and m(A,n) is equal to the asymptotic density of A. 

If A Cc N and there are no a, a’ witha € A, a’ € A,a#a’ and ala’, then 
A is said to be primitive. 

In 1935, Behrend [6] proved the following result: 


Theorem 1. [fn € N,n > 70,A C {1,2,...,n} and A is primitive, then 
we have 
£(A,n) < cx (loglogn)~'/?. 

The proof of this nice theorem is of a combinatorial nature; the crucial 
tool in the proof is Sperner’s theorem on subsets of finite sets. So that it 
is an Erd6s-type result, except that it is due to Behrend and not to Erdés; 
however, it is just a matter of a few months that now this result is known as 
Behrend’s theorem and not as Erdés’ theorem. The story is the following: 

Due to the steadily worsening political atmosphere in Hungary in the 
early thirties, Erddés was forced to leave the country in 1934. He took a train 
for Cambridge, and during the travel he felt very depressed since he had to 
leave all his relatives and friends for the unknown. Thus to cheer himself up, 
he started to do some mathematics. After a while, he ended up with the result 
formulated in Theorem 1. Arriving to Cambridge, Davenport and Radé were 
waiting for him at the station. Erdés told them his new result immediately. 
They said that, indeed, it was a nice result but unfortunately, Behrend had 
proved it a few months earlier. Erdés [9] consoled himself soon by proving 
a series of important new results in combinatorial number theory. Moreover, 
he had another result on primitive sets not anticipated by anyone. Indeed, he 
proved that for the “log log density” of a primitive set better estimate can 
be given, than for the “logarithmic density”: 


Theorem 2. [fn EN, n> 79, AC {1,2,...,n} and A is primitive, then 
we have 


t(A,n) < ce 
(so that m(A,n) < co(loglogn)~"). 


This theorem might suggest that Theorem 1 can be improved. This is not 
so as Pillai [28] pointed out: 
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Theorem 3. Jfn € N, n > no, then there is a primitive set A such that 
Ac {1,2,...,n} and 
£(A,n) > ¢3(loglogn)~/?. 


If A C N is an infinite sequence of positive upper logarithmic density, 
then it cannot be primitive by Theorem 1. Davenport and Erdés [8] proved 
that having this assumption, more can be said: 


Theorem 4. [f ACN is an infinite set such that 
im sup (A, n) > 0, 
then A contains an infinite subsequence a1,d2,... such that a;|aj41 for i = 
ie rere 
For n > 3, write 
G(n) = max £(A, n)(log log n)!/? 


where the maximum is taken over all primitive sets AC {1,2,...,n}. Then 
by Theorems 1 and 3, for n > no we have 


ca < G(N) < cs. 
In 1948, Erdés [10] proved a result which implies that for n > no(e) we have 
G(n) > (2n)-1/? —. 
On the other hand, Anderson [2] showed that for n > no(e) we have 
Gin<a" +e 
In 1967, Erdés, Szemerédi and I [20] determined the best possible value of 
the constants in Theorems | and 3 by proving that we have 
G(n) = (1 + 0(1))(2m)-¥?. 
Moreover, in [21] we showed that for infinite sets, Theorem 1 can be improved: 


Theorem 5. If A CN is an infinite primitive set, then for n + +00 we 
have 


&(A, n) = o((log log n)~ 1/2). 


Alexander [1] and Erdés, Szemerédi and I [17] sharpened Theorem 4 in 
various directions. In [18] and [19], Erdés, Szemerédi and I studied the number 
of divisibility relations up to a certain bound, i.e., the function 


f(A,n) = I{(@,@') :a,a" € A,ala’,a’ < n}]. 


In [22], we gave a survey of all these results and our other related works. This 
survey paper contained many unsolved problems. The two most interesting 
problems are, perhaps, the following ones: 
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Problem 1. Is it true that if a,,a2,... are real numbers with 1 < ay < 
dg <---, and fori,j,k EN,i #7 we have 


\a; = ka,| > 1; 


then 


1 
( » = )dogn)-* < c@(log log n)~1/? 


and 


(This would generalize Theorems 1 and 2, respectively.) 


Problem 2. Is it true that for all « > 0 there is a K = K(e) such that if 
ACN and A is primitive, then we have 


Haight [24], resp. Erdés and Zhang Zhenxiang [23] have certain partial 
results, but in their original form both problems are unsolved yet. 

Pomerance and Sarkdézy [29] studied the following problem: if I(A,n) is 
large enough to ensure the existence of pairs a,a’ with a,a’ € A,a 4 a’, 
ala’, then what can be said about the arithmetic properties of the quotients 
a’ /a (with ala’)? If P is a finite set of prime numbers, A C N and there are 
no a,a’ witha € A, a’ € A,a 4a’, ala’ and | II p, then A is said to be 

pEeP 

P-primitive. In [29] we proved the extension of Theorem 1: 


Theorem 6. [fn EN,n>10,P ts a set of prime numbers not exceeding n 
with 
1 
s(P) = S- = Gy, 
peP ie 
Ac {1,2,...,n} and A is P-primitive, then we have 
£(A,n) < c9(s(P))71/?. 


We showed that, apart from the values of the constants no, cg and cg, the 
theorem is best possible for all n and ?, i.e., there exist n1, cio, C11 such that 
ifn > ny, and P is a set of prime numbers not exceeding n with s(P) > cio, 
then there is a set A such that AC {1,2,...,n}, A is a P-primitive and 


L(A, n) > e1(s(P))71/?, 
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Moreover, we discussed some consequences of the theorem and some further 
related problems. The most interesting problem, that we could not settle, is 
the following one: 


Problem 38. Is it true that if k,n EN, n> ng(k), AC {1,2,...,n}, a=a’ 
(mod k) for all a,a’ € A and 


(A, n) > ci2k7!(loglogn)~!/?, 
then there exist integers a,a’ such that a,a’ € A, aa’ and ala’? 


For ACN, n€N write 


d(A,n)= So 1, 


acA,a|n 
and let 
D(A, 2) = max d(A,n). 
In [13-15] and [16], Erdés and I studied the function D(A, x). 
Further related results have been proved by Anderson [3, 4], Erdés and 


Sarkézy [12], Meijer [26, 27], Sattler [30], Anderson, Cohen and Stothers [5], 
Cohen [7], Klotz [25] and Erdés [11]. 


3. A Further Result on P-Primitive Sets 


In this section, we will extend Theorem 2 to P-primitive sets (in the same 
way as Theorem 6 extends Theorem 1 to P-primitive sets). 
Ifn €N,n > 3 and P is a set of prime numbers not exceeding n, then let 


L(P,n) = max f(A, n) 
and 
M(P,n) = max m(A,n) 


where in both cases the maximum is taken over all P-primitive sets A = 


{1,2,...,n}. In the special case when P consists of all the primes not 
exceeding n, by Theorems 2 and 3 we have 
M(P,n) = o(L(P,n)). (1) 


We will show that this also holds in the more general case when P contains 
all or “almost all” the large primes, and it will be shown that a condition of 
this type is necessary. 


Theorem 7. There are absolute constants no, C13 with the following proper- 
ties: Assume thatn EN, n> no and P is a set of primes not exceeding n. If 
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> 100 (2) 


Blr 


psn,peéP 


then let y denote the smallest positive integer y such that 
1 
log log log y > 3 =; (3) 
y<psn,peP © 


if the left-hand side of (2) is < 100, then let y = 10. Assume that A C 
{1,2,...,n} and A is A-primitive. Then we have 


t(A) < ci3 log log y. (4) 
Note that Theorem 7 is certainly superior to Theorem 6, i.e., (1) holds if 
i 1 
y 5 < (5 - :) log log login 
pSn,pEP 


(namely, in this case y in Theorem 7 satisfies loglog y < (loglogn)!/2-*/2), 
Moreover, Theorem 7 is superior to Theorem 6 in the important special case 
when P is of the form 


P={p:z<p<n,p prime}. (5) 
In this special case we obtain 


Corollary 1. There are absolute constants no, ci4 such that ifn > no, 1< 
z<n,AcC {1,2,...,n} and 


t(A) > c14 log log(z + 3) (6) 


then there area, a’ witha€ A, a’ € A, aa’, ala’ and p(a'/a) > z. 


Note that in the z = 1 special case we obtain Theorem 2. 

Corollary 1 (and thus also Theorem 7) is best possible apart from the 
value of the constants no, ci4. Indeed, if we take A = {1,2,...,[z]}, then 
clearly we have 


t(A) < cys log log(z + 3) 
and there are no a, a’ witha € A, a’ € A, aa’, ala’ and p(a/a’) > z. 
Proof of Theorem 7. Put 
Ai ={a:a<y,a€ A},A,={a:y<a,ace A} 
so that 
t(A) =t(Ai) + t(A2). (7) 


Clearly we have 
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1 
t(A1) = S- S- aloga 


2<a<y,acA aloga - 2<a<y 
= (1+ 0(1)) log logy < cig log log y. (8) 
Now we will estimate t(Az). Write P = |] p, and for m € N, m > 1, let 


pEP 
p(m) denote the smallest prime factor of m. Consider all the integers of the 
form 


ad wherea€ Az, d|P and p(d)>a or d=1. (9) 


Now we will show that these numbers are distinct. We will prove this by 
contradiction: assume that 


ad = a'd',a <a’, d|P,d'|P,p(d) > a or d=1, and p(d') > a’ ord’ =1. 


(10) 
It follows that 
d'\ad (11) 
and either d’ = 1 or p(d’) > a’ > a; in both cases we have 
(a,d’) = 1. (12) 
It follows from (11) and (12) that d’d. Then by (10) we have 
a: < iH 


whence ala’ and a = $|d, d|P so that a |P which contradicts the assumption 
that A is P-primitive. This proves that, indeed, the numbers of form (9) are 
distinct. Let 6 denote the set of these numbers. 

If b = ad € B, then the prime factors of ad do not exceed n. Thus by 
Mertens’ formula 


II (1 = =) = (1+0(1))ciz(logz)~*, (13) 
we have - 
1 +00 1 1 “4 
a(B) = py aa < U & =) = LL (1 a *) < Cig logn. (14) 


On the other hand, by (13) and the definition of y we have 


1 1 1 1 1 
2G a= 2G Ns) 
yy ad x . a d » : ae P 
p(d)>a or d=1 


“2 GIO) 0) ) 


a€Az a<p<n a<p<n,p€éP 
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ECM.) IL) ) 


a€Az a<p<n y<p<n,pEP 
1 1 1 
>an(- > eee TL O45) 
y<p<n.p¢P P acAeg © a<p<n - 


-1 
1 1 
> cig exp(— log log log y) » . II (1 — ~) 


a€Ag  a<p<n 


llogn 
> eaq(loglogy)"' SS) = 
ae Az2,2<a os log ie 


= C29 (log log y)~* (log n)t(A2) 


since 
l+a<e” forall «>0 
and 
1 ie ta 
M(+5)- 0-3) 0 G-5) 
pSu psx P pcx P 


It follows from (14) and (16) that 
t(Az) < c22 log log y. 


(16) 


(17) 


Finally, (4) follows from (7), (8) and (17), and this completes the proof 


of Theorem 7. 


Proof of Corollary 1. We apply Theorem 7 with the set P defined by (5). 
Then the right-hand side of (3) with z in place of y is 0, thus the number y 
defined in Theorem 7 satisfies either y < z or y = 10. Thus by Theorem 7, 


for a P-primitive set we have 


t(A) < c13 log log y < c13 log log max(z, 10) < c93 log log(z + 3). 


Thus choosing c14 = C23 in Corollary 1, (6) implies that A is not P-primitive 


which completes the proof of Corollary 1. 


Appendix 


The paper above appeared in 1997. Since that time, 10 related papers have 
been published; they will be listed at the end of this Appendix. I will keep 
the notations and reference numbers (from [1—30]) of the original paper while 
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the reference numbers of the papers published since 1997 will start with [31]. 
In this appendix my goal is to present a short survey of these recent papers. 

In [35] Ahlswede, Khachatrian and Sark6zy extended the study of the 
density of “large” primitive sets from the logarithmic density and “loglog 
density” to other densities defined in terms of different weight functions. 
Among others they proved that if the weight function f : N —> [0,00) is 
“smooth” in a well defined sense, then for « > 0, N > No(e, f) there is a 
primitive set A Cc {1,2,...,N} such that its density with respect to f is 
greater, than (1 — 6) pecan: and if a conjecture of Erdés is assumed then 
this lower bound is essentially best possible. 

In [34] Ahlswede, Khachatrian and Sarkozy studied “large” primitive sets 
consisting of squarefree integers. Among others, they proved that considering 
all the primitive sets A C {1,2,...,N} consisting of squarefree integers we 
have 
6 log N 
m2 (27 log log N)?/2 


1 
max b> ~ = (1+ 0(1)) 
acA 
which settles a conjecture of Pomerance and Sarkézy [29]. 

In [22] we wrote: “The following problem seems difficult: Let bj <--- be 
an infinite sequence of integers. What is the necessary and sufficient condition 
that there should exist a primitive sequence a, < --- satisfying a, < cby 
for every n?” This was one of the favorite problems of Erdés. However, in 
this original form the problem seems hopelessly difficult. Thus Ahlswede, 
Khachatrian and Sarkozy [32] studied the following milder and more realistic 
version of the problem: How fast can the counting function A(#) = |AN[1, z]| 
of an infinite primitive set grow? It follows from Erdés’s Theorem 2 above 
that for a primitive set A we must have 

x 


A 
2) s (log log x) (log log log x) 


for infinitely many « € N, and we proved in [32] that for ¢ > 0 there is an 
infinite primitive set A with 


A(a) > 


x 


(log log x) (log log log x) 1+" 


Martin and Pomerance [40] improved on this result significantly. Denote the 
k-fold logarithm of x by log, «. They proved (in a slightly simplified form): 
for any integer k > 3 and every real number ¢ > 0 there exists a primitive 
set A such that 

x 


A(x) x : 
@) log, x---log;,_1 x(log, x)!©’ 


this estimate leaves a very small gap between the best lower and upper 
bounds. 
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Porubsky [41] generalized some of the results on primitive sequences of in- 
tegers to multiplicative arithmetical semigroups G satisfying J. Knopfnacker’s 
“Axiom A”. 

For A CN let Q,, denote the set of the integers which can be represented 
as a quotient a/a’ with a,a’ € A, and let QP denote the set of the 
integers which have infinitely many representation in this form. Ahlswede, 
Khachatrian and Sarkézy [31] studied the size properties of Q.4 and Q9 
under various density assumptions on A. 

If py < po < +++ < py are primes, r < t, a = p...pp and b = 
Pi..-PrPr+1---pe, then a is said to be a prefix of b. If the set A C N is such 
that no element of it is a prefix of another then A is said to be prefix-free. 
The notion of suffix and suffix-free set is defined similarly. In [33] Ahlswede, 
Khachatrian and Sarkozy studied density properties of prefix-free and suffix- 
free sets. 

Let y1,Y2,---;Yn;--- be integers greater than 1. Then the products yy, 
Y1Y2, ---, YiY2°'' Yn, ---are called initial products. Beigbéck, Bergelson, 
Hindman and Strauss [36] proved that if A C N is “additively large” in a 
well defined sense then it contains initial products of a given length, resp. 
infinite sequence of initial products. 

In 1938 Erdés [39] estimated the maximal number of positive integers 
up to N with the property that no one of them divides the product of two 
others. Chan, Gyéri and Sarkézy [38], resp. Chan [37] extended this problem 
by studying sets of integers such that no one of them divides the product of 
k others. 
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1. Introduction 


In this paper we give a short survey of additive representation functions, in 
particular, on their regularity properties and value distribution. We prove a 
couple of new results and present many related unsolved problems. 

The study of additive representation functions is closely related to 
many other topics in mathematics: the first basic questions arose from 
Sidon’s work in harmonic analysis; analytical methods (exponential sums) 
and combinatorial methods are equally used; Erdés and Rényi introduced 
probabilistic methods, etc. 

Paul Erdés played a dominant role in the advance of this field. As 
Halberstam and Roth write in their excellent monograph [23] written on 
sequences of integers: 
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2. Notations 


The set of integers, non-negative integers, resp. positive integers is denoted 
by Z,No and N. A,B,... denote (finite or infinite) subsets of No, and their 
counting functions are denoted by A(n), B(n),... so that, e.g., 


A(n) = |{a:0<a<n,aeE A}f. 


A; + Ao +--+ +A, denotes the set of the integers that can be represented 
in the form a, +a2+---+ a,x with a, € Aj,...,a, € Ax; in particular, we 
write A+ A = 2A = S(A). For A C N, D(A) denotes the difference set 
of the set A, i.e., the set of the positive integers that can be represented in 
the form a — a’ with a,a’ € A. For A = {aj,a2,...} C No,k € N we write 
kxAz= {ka,, kaa, oe ih 


Representation Functions 


For A C No, n € No the number of solutions of the equations 


/ / 
ata=n a,a EA, 
/ / tf 
ata =n, a,a €A, a<a 
and 
/ Vs / 
at+a=n, aja EA, a<a 


is denoted by 11(A,n), r2(A,n), resp. r3(A,n) and are called the additive 
representation functions belonging to A. 

For g € N, Ba[g] denotes the class of all (finite or infinite) sets A C No 
such that for all n € No we have r2(A,n) < g, i.e., the equation 


at+a=n, a,a’ EA, a<a’ 


has at most g solutions. The sets A € Bg[1] are called Sidon sets. 
If F(n) = O(G(n)), then we write F(n) < G(n). 


3. The Representation Function of General Sequences. 
The Erd6és-Fuchs Theorem and Related Results 


Erdés and Turan [22] proved in 1941 that for an infinite set A C N, the 
representation function r;(A,n) cannot be a constant from a certain point 
on. Dirac [6] and Newman proved that the same holds with r2(A,n) in place 
of r1(A,n). Since their proof is short and elegant, we present it here: 
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Let 
f(z) = S- a” (for x real, |x| < 1). 
acA 
If ro(A,n) = k for n > m, then 


1 5 a _<S r — port 
5UMe) + FC) =D ral Ande® = Pale) + 8 


where P(x) is a polynomial of degree < m. If « + —1 from the right, then 
the right-hand side has a finite limit while the left-hand side tends to +00. 
This contradiction proves the theorem. 

Moreover, in [22] Erdés and Turan conjectured that their result can be 
sharpened in the following way: if A C N and c > 0, then 


N 
S"ri(A,n) = eN + O(1) 
n=1 
cannot hold. 
In [12], Erdés and Fuchs proved two theorems one of which sharpens the 
above mentioned result of Erdés and Turan: 


Theorem 1 (Erdés and Fuchs [12]). Jf A = {ai,a2,...} C N,c > 0, or 
c=0 and a, < Ak? (for k =1,2,...), andi =1,2,3, then 


N 


1 
lim sup — r;(A,n) —c)? > 0. 
ee a) a) 


Their other, better known result (in fact, this is the result known as 
“the Erdés-Fuchs theorem” ) proves the conjecture of Erdés and Turan in the 
following sharper form: 


Theorem 2 (Erdés and Fuchs [12]). Jf A CN,c>0, then 


N 
SS" ri(A,n) = eN + 0(N'/4(log N)~1/?) (1) 


cannot hold. 


One of the most important problems in number theory is the circle 
problem, i.e., the estimate of the number of lattice points in the circle 
x? +y? < N. Writing 

A(N) = |{(,y):2,y € Za? +y? < N}|—aN, 
the problem is to estimate A(N). By a classical result of Hardy and Landau, 


one cannot have 


A(N) = 0(N/4 (log N)*/4). (2) 
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The importance of Theorem 2 is based on the fact that the special case 
A = {1?,2?,...} of it corresponds to the circle problem, and the -estimate 
proved in the much more general Theorem 2 is only by a logarithm power 
worse than (2). 

Theorem 2 has been extended in various directions. Bateman, Kohlbecker 
and Tull [3] studied the more general problem when the left-hand side of (1) is 
approximated by an arbitrary “nice” function (instead of cN). Vaughan [40] 
extended the result to sums of k(> 2) terms (see also Hayashi [24]). 
Richert [29] proved the multiplicative analogue of Theorem 2. Sarkozy [34] 
extended Theorem 2 by giving an (-result on the number of solutions of 


atb<N, aca, be B. 


Jurkat showed (unpublished) that the factor (log N)~'/? on the right-hand 
side of (1) can be eliminated, and recently, Montgomery and Vaughan [28] 
published another proof of this result. 

Erdés and Sarkozy [14, 15] showed that if f(n) + +oo, f(n+1) > f(n) 


for n > no and f(n) = o( qoyyyz), then 
max|ri(A,n) — f(n)| = o((F(N))”?) (3) 


cannot hold (see also Vaughan [40], Hayashi [24, 25]). Erdés and the authors 
continued the study of the regularity properties of the functions r;(A,n) 
in [16, 17] and [18], first by studying the monotonicity properties of these 
functions (see also Balasubramanian [2]). In an interesting way, here the 
three representation functions r1(A, 7), re(A,n), 73(A,n) behave completely 
differently. 

We proved 


Theorem 3 (P. Erdés, A. Sarkézy, V.T. Sés [17]). 


(a) r1(A,n) can be monotone for n > no only in the trivial case when A 
contains all the positive integers from a certain point on; A(N) = N-—c 
for N > ny. 

(b) There is an infinite set A such that N— A(N) > N/3 and r3(A,n) is 
monotone increasing for n > no. 

(c) If 

N — A(N) 
Nes logN sd 


then r2(A,n) cannot be increasing from a certain point on. (See also 
Balasubramanian [2].) 


But we still do not have the answer for 


Problem 1. Does there exist an infinite set A such that N \ A is infinite 
and ro(A,n) is increasing from a certain point on? 
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As Theorem 6 below shows, it may change the nature of the problem 
completely if a “thin” set of sums can be neglected. Here we mention two 
problems of this type: 


Problem 2. Does there exist a set ACN such that N — A is infinite and 
ri(A;n +1) 2 ri(A,n) 


holds on a sequence of integers n whose density is 1? If such a set exists, then 
how “dense” can N\A be? 


Problem 3. Does there exist an arithmetic function f satisfying f(n) > co, 

f(n+1)> f(n) forn> no, and f(n) = o(togm)) and a set A such that 
[ri(A,n) — f(n)| = o((f(r))”) 

holds on a sequence of integers n whose density is 1? 


Next we studied the following problem: for which sets A Cc N is 
\r1(A,n + 1) —71(A, n)| bounded? Since we have recently given a survey [21] 
of these results, thus we do not present further details here. 

We complete this section by adding two problems that the first author of 
this paper could not settle in [34]. 


Problem 4. Is it true that if a, < a2 <--- and by < bo <--- are infinite 
sets of positive integers with 


lim SF =] 
k>+00 bp 
and c>0, then 
{(,9) sa; +b; < N}| =cN +O0(1) 
cannot hold? 


Problem 5. Is it true that if ay < ag <--: is an infinite set of positive 
integers with 
Ok+1 — Ak > a,” 


and f(n) is a “nice” function (say, its second difference f(n + 2) — 
2f(n+1)+ f(n) > 0) with 


n< f(n)<ni*, 
then 
I{(i, 9) 10 < la; — aj| < N}| = f(N) +002) 
cannot hold? 


(This would cover Dirichlet’s divisor problem in the same way as the 
Erdés-Fuchs theorem covers the circle problem.) 


238 Andras Sark6zy and Vera T. Sdés 


4. A Conjecture of Erdés and Turan and Related 
Problems and Results 


In 1941 Erdés and Turan [22] formulated the following attractive conjecture: 


Conjecture 1 (Erdés and Turan [22]). If AC N and 1ri(A,n) > 0 for 
n> no (i.e, A is an asymptotic basis of order 2), then r1(A,n) cannot be 
bounded: 
lim sup r1(A, 2) = +00. (4) 
n—+00 
This harmlessly looking conjecture proved to be extremely difficult: since 
1941 no serious advance has been made. Erdés and Turan formulated an even 
stronger conjecture: 


Conjecture 2 (Erdés and Turan [22]). [fai < a2 < -:- is an infinite 
sequence of positive integers such that for some c>0 and allk © N we have 
ax < ck?, then (4) holds. 


Erdés and Fuchs [12] remarked that having the same assumptions as in 
Conjecture 2, the mean square of 71(A,n) can be bounded: there are ac > 0 
and an infinite set A C N such that a, < ck? for all k € N and 


N 
1 
lim sup — r2(A,n) | < +00. 5 
inane (3 i ) (5) 


Answering a question of Erdés, Ruzsa has proved recently the analogous 
result in connection with Conjecture 1: 


Theorem 4 (Ruzsa, [32]). There is an infinite set A C N such that 
ri(A,n) > 0 for alln > no and (5) holds. 


If Conjecture 1 is true, then assuming that A C N, A is infinite and 
r2(A,n) is bounded, the function r2(A,n) must assume the value 0 infinitely 
often. Erdés and Freud [11] conjectured that having the same assumptions, 
ro(A,n) must assume also the value 1 infinitely often, i.e., there are infinitely 
many integers n € S(A) whose representation in the form 


at+a=n, aa €éJ, a<a’ (6) 


is unique. This attractive conjecture seems to be true although probably it 
is very difficult. Moreover, they write “Probably there are “more” integers n 
with a unique representation of the form (6) than integers n with more than 
one representation.” We will show that this is not so; at least for A € B(q), 
g= 3. 


Theorem 5. For every g € N, g > 2 there is an infinite set A C No such 
that A € Bz[g] and fore > 0, n> no we have 
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2 
2g—3 
Proof. Let € = {e1, e2,...} be an infinite Sidon set, and define A by 

A=2gx €4+{0,1,...,g—1}. 
We will show that this set A has the desired properties: 


(i) A€ Bolg}, 
(ii) A satisfies (7). 


If r2(A,n) > 1 for some n EN, ie., n € S(A), then, by the construction 
of the set A, n can be represented in the form 


l{n:n< N,re(A,n) = 13] < (1+e) l{n:n< N,re(A,n) > 1}. (7) 


(2ge +7) + (2ge’ +7) =2g(e+e’)+(it+j)=n (8) 

where 
e,e EE, (9) 
2ge+i<2ge'+), (11) 


and r2(A,n) is equal to the number of integers e,e’,i,7 satisfying (8), (9), 
(10) and (11). It follows from (10) and (11) that 


e<e’ (12) 
and 
O<it+j< 29-2. (13) 
Define the integers u,v by 
n=2gut+u, 0O<v< 2g. (14) 
Then it follows from (8), (13) and (14) that 
e+e’ =u (15) 
and 
itj=v (16) 


(where v < 2g — 2). Since € is a Sidon set, (12) and (15) determine e and 
e’ uniquely. Thus r2(A,n) is equal to the number of pairs (7,7) satisfying 
(10), (11) and (16). If e < e’, then (11) holds automatically, and the number 
of solutions of (10) and (11) is vu+1 for v < g—1 and 2g — v — 1 for 
g—1<v< 2g -—2. Denote the set of the integers n that can be represented 
in the form 


n=2g(e+e') +i (where e < e’, e,e’ EE) (17) 
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with 7 = 0 or 2g — 2 by K, and let £ denote the set of the integers n of 
form (17) with 1 < i < 2g—3. Then it follows from the discussion above that 


=lfornekK 
ra(A,n) { Sen ue) 
and clearly we have 
2 


Finally, if ro(A,n) > 1 and n ¢ KUC, then n can be represented in the form 
n=2ge+v withe€€, O<u<2g-2; 
let M denote the set of the integers n of this form. Clearly, 
M(n) = 0(K(n)). (20) 
It follows from (18), (19), (20) and 
{n:néEN,re(A,n) > 1} =KULUM 


that 
= 
2g—3 


which completes the proof of the theorem. 


l{n:n < N,ro(A,n) = 1}| = (14+ o(1)) l{n:n< N,re(A,n) > 1} 


By Theorem 5, it is not true that if r2(A,n) is bounded, then 
r2(A,n) =1 (21) 
holds more often than 
ro(A,n) > 1. 


On the other hand, we think that (21) must hold for a positive percentage of 
the elements of S(A): 


Problem 6. Show that if A CN is an infinite set such that ro(A,n) is 
bounded, then we have 


l{n;n < N,r2(A,n) = 1}| 
| 
oo S(A,N) 


Note that it could be shown that the limsup in (22) cannot be replaced 
by lim inf. 

Moreover, if (22) is true, then for sets A € Bo[g] one might like to give a 
lower bound in terms of g for the lim sup in (22). Perhaps Theorem 5 is close 
to the truth so that this lim sup is > - The special case g = 2 seems to be 
the most interesting and, perhaps, in this case there is a good chance for a 
reasonable lower bound: 


> 0. (22) 
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Problem 7. Assuming, that A C N is an infinite set with A € B2[2], i.e., 
ro(A,n) < 2 for all n, give a lower bound for 


lim su Jin ins N,ro(A,n) = 1 
mites {nin <N,r2(A,n) = 2h] 


By Theorem 5, this limsup can be < 2; is it true, that it is always > 2? 


By our conjecture formulated in Problem 6, the assumption 
r2(A, n) = O(1) (23) 


implies that r2(A,n) = 1 must hold for a positive percentage of the elements 
of S(A). First we thought that (23) can be replaced by the weaker condition 
that r2(A,n) is bounded apart from a “thin” set of integers n and still the 
same conclusion holds. Now we will show that this is not so and, indeed, for 
every finite set U C N there is a set A such that, apart from a “thin” set of 
integers n, r2(A,n) assumes only the prescribed values u € U with about the 
same frequency. 
For AC No, u € N denote the set of the integers n € N with 


ro(A,n) =u 
by S,(A) so that S(A) = UF25 Su(A). 


Theorem 6. Let k € N and let uy < ug < ... < ux be positive integers. 
Then there is an infinite set AC No such that writing 


B=N\ (Us.a) 


we have 
5,,(A,N) = 7 + OW®) 
and 
B(N) = O(N®) 
where a = 284. 


(Here S,,,(A,.N) denotes the counting function of S,,,(A).) 
Thus, e.g., there is a set A such that ro(A,n) = 2 for all but O(N%) 
values of n with n < N. 


Proof. The proof will be based on the following lemma: 


Lemma 1. Let F andG denote the set of the non-negative integers that can 
be represented in the form baer 642°", resp. peer 6,274! where e; =0 orl 
for alli, and write H = FUG. Then 
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(i) Every n € N has a unique representation in the form 
f+tg=n, fEF, Gg EG; 


(ti) S(F,N) = O(N); 
(wii) S(G,N) =O); 
(iv) We have 


|{n:n€N,ro(H,n) > 1}| = O(N®). 


Proof. (i) is trivial. 
(ii) follows from the fact that if n € S(F), then representing n in the form 
n =>) €i4' where €; = 0,1,2 or 3, we have 0 < ej < 2 for all i, ie., 
the digit 3 is missing. 
(iii) follows from (ii) and G = 2 x F. 
Finally, (iv) follows from (i), (ii) and (iii), and this completes the proof 
of the lemma. 


Now we will construct a set A of the desired properties. Denote the 
elements of the set G (defined in Lemma 1) by (0 =)g1 < go < -:-, write 
Gi = {91,92,--+;Gu;} and L; = k x (F+G;) + {i} for i = 1,2,...k, and 


finally, let A = (Uh, £;) Ul(k x G). Clearly, it suffices to show that 
(i) If i € {1,2,...,k}, n € N, n = i (mod k) and n is large enough 
(depending on u;), then n has exactly u; representations as the sum 
of an element of Shar £; and an element of k x G; 
(ii) For 1 <i<j<k we have 
{nin <N, ne £;+L;}| = OW); 
(iii) S(k x G,N) = O(N®%). 
To prove (i), define m by n = km +i, and consider a representation of n 
in the desired form: 


k 
l+kg=n=khm+i, lelL, geg, (24) 


j=l 


By the definition of the sets £;, we have @ € £,; if and only if 


l=k(f+o)+3 (25) 
for some f € F,1<t < uj. It follows from (24) and (25) that 
Kft+toet+g)+7=km+i. (26) 


By 1 < i,j < k, this implies that ¢ = 7. Thus (26) can be written in the 
equivalent form 


f+g=mMm— 4. 
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By (i) in Lemma 1, for m > gy, and each of t = 1,2,...,u;, this equation 
has exactly one solution in f and g. Again by (i) in Lemma 1, these u; pairs 
(f,g) determine distinct solutions (¢,g) of (24). 

To complete the proof of (i), it remains to show that distinct pairs (£, 9), 
(’, 9’) satisfying (24) (also with (¢’, 9’) in place of (¢,g)) determine distinct 
representations of n if n is large enough, i-e., if 


Ctkg=n=l' +k’ (27) 


and n is large, then ¢ 4 kg’, ¢’ 4 kg. Indeed, assume that contrary to this 
statement we have 


l=kg', U=kg. (28) 
Then by (27) and (28), + ¢’ =n. Hence 
L>n/2 (29) 
or ' > n/2; we may assume that (29) holds. By (25) and (28) we have 
l=k(f +g) +5 = kg’. (30) 
By 1<j <k, it follows that 7 = k. Thus (30) implies 
ftoatl=q’. (31) 
By (25) and (29), we have 
f — +00 as n —> +00. (32) 
It is easy to see that 
_ lim ePasee, lf —g| = +00. (33) 


By (32) and (33), (31) cannot hold for t < u, and large n. This contradiction 
completes the proof of (i). 
To prove (ii), observe that n € £; +L; implies that 


n€kx(F4+G;)+{i} +kx(F+G;)+ {7} =kxS(F)+ {i+ gp} $k x Gi +k xG;). 


Here {i+ j} +kxG;+k~xG; is a finite set (in fact, it has at most uz elements). 
Thus (ii) follows from Lemma 1 (ii). 
Finally, by S(k x G) =k x S(GQ), (iii) follows from Lemma 1 (ii). 


Remark 1. Let 7; € Qt, 1 <i<k with em r; = 1. Using the same idea 
as in the proof of Theorem 6 we can prove the existence of an infinite set 
A C No for which 


Su,(A, N) =73N + O(N®) iegsd 


with some 0 < a < 1. It seems likely that an analogous theorem holds with 
arbitrary given densities ;,1 <i <k, in place of r;. If so, the proof will be 
more involved. 
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5. Sidon Sets: The Erdds-Turan Theorem, Related 
Problems and Results 


In 1932 Sidon [36] in connection with his work in Fourier-analysis considered 
co co 

power series of type }> 2% when ( > za)" is of bounded coefficients. This 
i=l i=l 

led to the investigation of finite and infinite sequences (a;) with the property 

that for g fixed the number of solutions 


di, +++ +04, =n 


is bounded by g forn € N. 

Sidon sequences correspond to the case h = 2 and g = 1, ie. r2(A,n) < 1. 
Recall that for g € N, Bo(g) denotes the class of all (finite or infinite) sets 
A CNo such that for all n € N we have ro(A,n) < g. 

Some specific lines of investigations are the following: 


(a) For A € Bo(g) and AC [1,..., N] how large can |A| be? In the infinite 
case how fast can the counting function A(n) grow? 

(b) What can we say about the structure of A resp. A+ A if |.A] resp. A(n) 
is large? 


There is an excellent account on this subject in Halberstam-Roth [23] and 
also a recent survey Erdés-Freud [11]. 

While there are many results on Sidon sets, much less is known on sets 
A € Bgl[g]. In particular, let F'(.NV,g) denote the cardinality of the largest set 
A € Bglg] selected from {1,2,...,N}. Chowla [5], Erdéds [7] and Erdés and 
Turan [22] gave quite sharp estimates for the cardinality of the largest Sidon 
set selected from {1,2,...,N}: 


Nie — O(N) 2 FN A) < NY + OW"). (34) 


On the other hand, very little is known on F'(N,g) for g > 1. Clearly we 
have 


F(N,g) > F(N,1) (- (i+ o(1))N/?) 


for all g € N. Erdés and Freud [11] showed that F(N,2) > 2!/?N1/?. On the 
other hand, a trivial counting argument gives 


F(N,g) < 2g'/?N¥V?, 
Problem 8. Show that for all g € N the limit limy_.4.. F(N,g)N7\/? 


exists, and determine the value of this limit. In particular, estimate F(N, 2). 


Further, very little is known on sets A € Bg[g] and their Sidon subsets. 
Erdés, resp. Ruzsa (see [7]) studied the size of Sidon sets selected from given 
sets A € Bo[g]. 

A related problem is the following: 
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Problem 9. Js it true that for g > 2, every Sidon set selected from 
{1,2,...,N} can be embedded into a much greater set A € Bgalg] selected 
from {1,2,...,N}? 


In other words, if A Cc {1,2,...,N} is a Sidon set, then let H(A, N,g) 
denote the cardinality of the greatest set E such that E € Bolg], E Cc 
{1,2,...,N} and A c €. Is it true that writing K(N,g) = min(H(A, N, g) -— 
|A|), where the minimum is taken over all Sidon sets A selected from 
{1,2,...,N}, we have 


lim K(N,2) = +00? 
N-+00 


How fast does the function K(N,g) grow in terms of N? Is it true that 
lim (K(N,g+1)-— K(N,g)) = +co for all gE N? 


N->+00 


A Sidon set Ac {1,2,...,N} is said to be mazimal if there is no integer 
b such that b € {1,2,...,N}, 6 ¢ A and AU {0} is a Sidon set. (Note that 
very little is known on the cardinality of maximal Sidon sets; see Problem 15 
n [15].) Another problem closely related to Problem 9: 


Problem 10. Does there exist a maximal Sidon set such that it can be 
embedded into a much larger set E € Bolg]? 

In other words, let L(N,g) = max(H(A, N,g)—|A|) where H(A, N, 4g) is 
the function defined in Problem 9 and the maximum is taken over all maximal 
Sidon sets selected from {1,2,...,.N}. Is it true that 


lim L(N,2) = +00? 


N-+0o 


Is it true that 
lim (L(N,g +1) — L(N,g)) = +00 


N--+00 


for all g € N? 
As Sect. 4 also shows, it may change the nature of the problem completely 
if a “thin” set of sums can be neglected. Several problems of this type are: 


Problem 11. How large set A can be selected from {1,2,...,N} so that tt 
is an “almost Sidon set” in the sense that 


Is] = (1+ 0c)? (35) 


It follows from a construction of Erdés and Freud [11] that there is a set 
A such that Ac {1,2,...,N}, (35) holds and 


|A| > (= + o(t)) Nie, (36) 


so that |A| can be much greater than F(N,1) = (1 + 0(1))N1/? (see (34)). 
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In the infinite case much less is known than in the finite case. Beyond 
what follows from (34), Erdés proved 


Theorem 7 (Stéhr [38]). There is an absolute constant c > 0, such that 
for every (infinite) Sidon sequence A 


A(n) < c(n/logn)!/? 
holds infinitely often. 


On the other hand, Kriickeberg, improving a result of Erdds, proved in 
1961 


Theorem 8. There is an (infinite) Sidon sequence A such that 


A(n) > ae 


holds infinitely often. 


It is not known whether or not the factor 1//2 is best possible. The 
greedy algorithm gives the existence of an (infinite) Sidon sequence for which 


A(n) > n'/3 for all n. 


Ajtai, Komlés and Szemerédi improved this [1]: There is a Sidon sequence A 
such that 


A(n) > c(nlogn)'/3 for all n > no. 


Weak Sidon Sets 


We considered Sidon sets defined by 

r2(A,n) <1 (37) 
which means that we require 

atyAutv (38) 


for any x,y,u,v € A of which at least three are different. 

In connection with some particular problems it is more appropriate to 
consider Sidon sets where we require (38) only for z,y,u,v € A where all 
four are distinct. (So we may have an arithmetic progression of length three, 
a solution of x + y = 2u.) 

If 


r3(A;n) <1 (39) 


holds, A is called a weak Sidon set. 
It is easy to see that the maximum size of Sidon set resp. of a weak Sidon 
set in [1, N] are asymptotically the same. 
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A problem of Erdés on the distribution of distances in the plane led us 
to formulate the following question: 

Let A* be a weak Sidon set. How large Sidon set A must be contained 
by A*? 

Another formulation of the problem is: 

Suppose that for A* c [1,N] any four distinct a;,,@;,,@j,,@;, € A* 
determine at least five distinct differences: 


[{lai, —ai,|, 1<u<p<4}[ 25. 


Let h(A*) denote the cardinality of the largest Sidon set AC A*. 
Let 


f(m) = | ae h(A*) 


If A* is a weak Sidon set, then for each a € A* there is at most one pair 
b,c € A* such that 6+ c = 2a. This implies that 


f(m) = 


Gyarfds and Lehel [27] proved that with some absolute constant 6 > 


Mm, 


ND] re 


ew 
100 


(5 +5) ms fm) < 2m41. 


Problem 12. Prove that lim £(m) exists and determine the limit. 
m—- oo 
It is very probable that a dense weak Sidon set contains a Sidon set of 
almost the same size: 


Problem 13. Suppose A* C [1,N] and m= |A*| > eN'/?. Is it true that 
h(A*) > d(e)NV/? 
where 6 > 1 ife +1? 


Remark 2. The problem of Sidon sets resp. weak Sidon sets is related to 
anti-Ramsey-type problems. 

Consider the complete graph Ky with vertex set V(Kw) = {1,...,N} 
and an edge-coloring y : [V]? + V where y(a, b) = |a—b|. A Sidon set A C V 
is the vertex set of a so-called totally multicolored complete subgraph (where 
all the edges have different colors). 

A weak Sidon set A* C V corresponds to the vertex set of a complete 
subgraph where independent edges have different colors. 


6. Difference-Sets 


Above we considered mostly sums a + a’. One might like to study the 
analogues of some of these problems with differences a — a’ in place of sums 
ata’. 
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Problem 14. In [19] and [20] we studied the structure of the sum set S(A) 
of Sidon sets A. What can be said about the structure of the difference set 
D(A) of Sidon sets A; in particular, 


(a) What can be said about the number and length of blocks of consecutive 
integers in D(A), 

(b) About the size of the gaps between the consecutive elements of D(A), 
etc.? 


Another closely related problem: 
In [19] we studied the solvability of the equation 


D(A) =B 


for fixed sets B C N and, in particular, we gave a quite general sufficient 
condition for the solvability of this equation and in fact we showed that 
under quite general circumstances, not only the elements of the difference set 
D(A), but also the number of solutions of 


a—a' =b, aa eA 


(for all b € B) can be prescribed. The nature of the problem completely 
changes if we restrict ourselves to Sidon sets A. 


Problem 15. Find possibly general conditions such that for sets B C N 
satisfying these conditions, there is a Sidon set A whose difference set is the 
given set B. 


One might like to see what is the connection between the behavior of 
sums and differences (see Ruzsa [33] for a related result): 


Problem 16. Consider finite sets A such that 
a—a’=d, a,aEe€A 


has at most two solutions for alld € N What can be said about the size of 
the Sidon sets, resp. sets A € B[2] selected from such a set A? 


Problem 17. Do there exist numbers 6 > 0, No such that for N > No there 
is a set AC {1,2,...,N} for which 


Aleit an 
and both 
a—-a=d, aadcéEA 
at+tad=n, aadcéA, axa’ 


have at most two solutions for alld EN, n€ N? 


On Additive Representative Functions 249 


7. Generalizations 


So far we have studied sums a+a’ and differences a—a’. Already in these two 
cases the difficulty of problems and the results can be completely different. 
It is even more so if we consider the linear form ca + c’a’, or more generally 
f(ai,...,@x) = cra, +-+++ crag where c; € Z for 1 <i < k and the c;’s are 
fixed. 

This is indicated already by the following simple but important example. 


Example of Ruzsa. Let A= {a (a= >) 62".e;=0 or i}. Then for n € N 
i=0 
the number of solutions 
a+2a'’=n, a,a’ € A, 


is 1 for anyn EN. 

This shows that the behavior of the representation functions depends 
very much on the coefficients of the linear form. Here we formulate only a 
few questions by extending the problems we discussed above. 


Problem 18. For which (c1,...,cx) can the representation-function 
R(A, C1,-++5,Ck; n), 


counting the number of solutions of cya, +++: + cpap =n (a1,..-,@% € A), 
be constant for n > No? 


Problem 19. For which (c1,...,cK) ts there an Erdés-Fuchs-type result, 
analogous to Theorems 1 and 2? 


Problem 20. For which linear forms is there an Erdés-Sdrkézy [15]-type 
result, when Ry(A,c1,..-,Ck;m) cannot be too close to a “nice” function? 


Problem 21. When and how the results on the monotonicity of ri(A;n) 
(see Theorem 8) can be extended to the linear form cya + +++ + Crag ? 


One may generalize these problems even further by studying polynomials 
f (a1, @2,...,@%). In particular, very little is known on products aa’. Erdés [10] 
estimated the cardinality of sets A such that Ac {1,2,...,N} and all the 
products aa’ with a,a’ € A, a <a’ are distinct. Moreover he [9] studied the 
multiplicative analogue of the Erdds-Turdn conjecture mentioned in Sect. 2. 
Three further problems involving products are: 


Problem 22. For ACN, neEN, let s(A,n) denote the number of solutions 
of the equation 


/ / 
aa=n, aaéeA, axa. 


Characterize the regularity properties of this function s(A,n) analogously 
as in the papers [14-18] where we discussed the additive analogue of this 
problem by studying r1(A,n), ro(A,n), r3(A,n). In particular, how well can 
one approximate s(A,n) by a “nice” arithmetic function f(n) ? 
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Problem 23. Find a multiplicative analogue of the conjecture of Erdés and 
Freud mentioned in Sect. 2 and, perhaps, this can be attacked more easily. In 
other words, is it true that if A CN is an infinite set such that the function 
s(A,n) defined in Problem 15 is bounded, then s(A,n) = 1 for infinitely many 
values of n? 

Problem 24. Roth [30, 31], Heath-Brown [26], Szemerédi [89] and others 
estimated the cardinality of sets A C {1,2,...,N} not containing three- 
term arithmetic progressions. Find a multiplicative analogue of this problem: 
estimate the cardinality of the largest set A C {1,2,...,N} not containing 
three term geometric progressions, 1.€., 


2 
a1a2 = a3, a1,42,a3 € A 


implies that a, = ag = a3. (Note that the square-free integers not exceeding 
N form a set A of this property.) 
Ramsey-Type Problems 


Many of the problems discussed above can be formulated in the following 
way: if A is a “dense” set of integers, then an equation of the form 


f (a2, d2,...,a,) =0 (40) 
can be solved with a1,a2,...,a, € A. There are several important results of 
the type where instead of considering solutions a1, ag,...,a,% belonging to a 


“dense” set A, we assume that a partition 
L 
N=(JA® (A@nNA® =O for 1<i<j<d (41) 
i=l 


of N is given, and then we are looking for “monochromatic” solutions of (40), 
ie., for solutions aj, @2,...,a@, such that all these a’s belong to the same set 
A: a result of this type can be called a Ramsey-type theorem. In particular, 
Schur [35] resp. van der Waerden [41] proved that the equation 


ay + a2 = a3, 
resp. 
a, + a2 = 2az, ay, # ag 


has a monochromatic solution for every partition (41) of N. (Indeed, van 
der Waerden proved the more general theorem that for every k € N and 
every partition (41), there is a monochromatic arithmetic progression of k 
distinct terms.) It follows from these results that for every partition (41) 
both equations 


ajag = a3 
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and 
ee) 
a1a2 = 43, ay # ag 


have monochromatic solutions. (Indeed, in both cases there is a solution of 
the form a; = 21, az = 2°, a3 = 2°3.) 


Problem 25. Characterize the polynomials f(a1,a2,...,a%) such that the 
Eq. (40) has a monochromatic solution for every partition of form (41) 
or, at least, find further polynomials f(a1,@2,...,@%) with this property. In 
particular, does there exist an integer m > 2 such that the equation 


2 2 2 _ 2 
ay + ag +++ + ay = On41 


has a monochromatic solution for every partition (41) ? 


8. Probabilistic Methods. The Theorems of Erdés 
and Rényi 


In [36] Sidon asked the following question: Does there exist an A C N such 
that 7(A,n) > 1 for all n > no, and r1(A,n) = O(n)? In 1956 Erdés gave 
an affirmative answer in the following sharper form: 


Theorem 9 (Erddés [8]). There is an infinite set A CN such that 
clogn <1ri(A,n) <cglogn forn>no. 


Erd6s proved this by a probabilistic argument. In fact, he proved that 
there are “many” sets A C N with this property. 

In 1960 Erdés and Rényi published an important paper in which, by using 
probabilistic methods, they proved several results on additive representation 
functions. The most interesting result is, perhaps, the following theorem: 


Theorem 10 (Erdés and Rényi, [13]). For alle > 0, there is a X = X(e) 
such that there is an infinite Ba[A] set ACN with 


A(n) > n¥/?-© for n > no(e). 


Note that for a Bg[\] set A we have A(n) = O)(n'/?). Thus Theorem 10 
provides a quite sharp answer to Sidon’s first question, mentioned in Sect. 5. 

Remarkably enough, this paper of Erdés and Rényi appeared in the same 
year as their paper written “On the evolution of random graphs” which had 
tremendous influence on graph theory and led to one of the most extensively 
investigated and comprehensive theories in graph theory. (See Bollobds [4].) 
On the other hand, the paper [13] was nearly unnoticed for about three 
decades. 

The paper [13] of Erdés and Rényi was somewhat sketchy. In their 
monograph [15] Halberstam and Roth worked out the details. In [16], Erdés 
and Sarkézy extended Theorem 9 by showing that if f(n) is a “nice” 


252 Andras Sark6zy and Vera T. Sdés 


function (e.g., combination of the functions n%, (logn)®, (loglogn)7) with 
f(n) > logn, then there is an AC N such that 


[ri(A,n) — f(r) « (f(r) logn)'”?. 


(Compare this with their result [14] on Eq. (3).) 

The really intensive work in this field started only about 2-3 years ago. 
Erdos, Nathanson, Ruzsa, Spencer and Tetali have proved several remarkable 
results. Since their papers have not appeared yet, some of them have not even 
been written up yet, it would be too early to survey their work here. 


Remark 3. Many of the problems in additive number theory are or can be 
formulated for arbitrary groups, semigroups or for some specified structures, 
like for set systems. (An independent source for Sidon-type problems is for 
example coding theory.) We refer to a survey of V.T. Sdés [37] on this subject. 


Appendix 
Al. Introduction 


The paper above appeared in 1997. Since that time more than 100 papers 
have been published on related problems. In this Appendix our goal is to 
give a short survey of these papers. In order to limit the extent of it we will 
focus on the most important results, and in the reference list we will present 
only the records of the most important and most recent papers, and a few 
survey papers; the references to the further related work can be found in 
these papers. 


A2. Notations 


We will keep the notations and the reference numbers of the original paper; 
thus, e.g., Problem 2 will refer to the second problem in Sect. 3 of the original 
paper. On the other hand, we will refer to the sections and references in the 
Appendix by using a prefix A so that, e.g., the second item in the reference 
list of the Appendix is marked as [43]. 


A3. The Representation Function of General Sequences. The 
Erdés-Fuchs Theorem and Related Results 


Sarkézy [88] proved the following local version of Theorem 1 of Erdés and 
Fuchs: for all C > 0 there are No = No(C) and C; = C1(C) so that if ACN 
and N > No, then there is an M with 


M 
N<M<N? and )\(R(n)-C)? > CM. 


n=1 
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He also showed that this result is best possible: for all ¢ > 0 there isan ACN 
such that for infinitely many N we have 


M 
S(R(n) — 2)? < eM for all N<M< aa 


n=1 
Ruzsa [81] proved a “converse” of the Erdés-Fuchs theorem (Theorem 2) 


by showing that there exists a non-decreasing sequence A of nonnegative 
integers such that 


N 
dnl (A,n) =cN + O(N'4 log N) 


for some constant c > 0. 

Tang [93] sharpened Vaughan’s result [40] on the extension of the Erdés- 
Fuchs theorem to k term sums, and later Chen and Tang [46] estimated the 
constant implied by the ordo notation. 

Horvath [68] extended the Erdés-Fuchs theorem further by considering 
the sum A; + Ap +--+ Ax of different sets A,,A2,...,Ax, and later in 
another paper [64] he sane this result. 

Let A= {a; < ag <---} be an infinite sequence of nonnegative integers, 
and write 


R(A, a; k) = |{(@i,---,4%,) € A® 2 Gi, Hes + 1, < z}| 
and 
P(A,a;k) = R(A,2;k) — cx. 


Chen and Tang [45] estimated the mean square of this discrepancy P(A, x; k). 

Lev and Sarkézy [74] proved an Erdés-Fuchs-type theorem for finite 
groups, and they showed that their result is sharp. 

Horvath [65] proved the following theorem which is closely related to the 
first theorem of Erdés and Fuchs (Theorem 1): If A = {aj,a2,...} (a1 < 
a2 <---) is an infinite sequence of nonnegative integers and d is a positive 
integer then there is no integer no such that for all n > np we have 


d<r3(A,n) <d+ [vad 3]. 


Sandor [86] proved a similar theorem, and Chen and Tang [49] extended 
Horvath’s theorem to & term sums and the k term analogues of the other two 
functions ry and ro. 

In our original paper we mentioned the results of Erdés and Sarkozy 
[14, 15] that if the function f(n) satisfies certain assumptions, then (3) cannot 
hold, and that this theorem is nearly sharp. Horvath [66] extended the first 
result to k term sums in place of r;(A,n), and Kiss [71] proved that Horvath’s 
result is nearly best possible. 
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In [16] Erdés, Sark6zy and T. Sds proved that if A is an infinite set of 
positive integers, and, denoting the number of blocks formed by consecutive 
integers in A up to N by B(A, N), we have 

fog, “PGND hese 

N--+00 N1/2 : 
then the differences |r)(A,n + 1) —11(A,n)| cannot be bounded. They also 
showed that this result is best possible. Kiss extended the theorem to kth 
differences |A;,(R(n))|, and later he also showed [69] that his result is sharp. 

In a recent paper Sarkdzy [89] studied the analogues in Z/mZ of the 
problems considered in [16]. 

The results of Erdés, Sdérk6zy and T. Sds [17, 18], resp. Balasubrama- 
nian [2] on the monotonicity properties of additive representation functions 
have been extended by Tang [94], Chen and Tang [47, 48], resp. Chen, 
Sark6zy, T. Sés and Tang [50] in various directions. In particular, it is proved 
in [48] and [50] that if A is an infinite set of positive integers such that its 
complement B = N\A satisfies certain simple conditions then r2(A,n) cannot 
be ultimately increasing. However, Problem 1 is still open in its original form. 

S. Giri settled the first half of Problem 2 by constructing a set A of the 
desired properties (unpublished yet). It might be interesting to study the 
second half of the problem as well: how dense can N \ A be for such a set A? 

Problems 3-5 are still open. 


A4. A Conjecture of Erdés and Turan and Related Problems 
and Results 


Grekos, Haddad, Helou and Pihko [60] proved that if A is a set of nonnegative 
integers such that 


ri(A,n) > 1 (A4.1) 


for every n € N then we have r;(A,n) > 5 for infinitely many n, and Borwein, 
Choi and Chu improved this to r1(A,n) > 7. 

Konstantoulas [72] proved that if there is a number no such that if (A4.1) 
holds for n > m9 then we have 7(A,n) > 5 for infinitely many n. 

By Ruzsa’s Theorem 4 there exists an asymptotic basis A of order 2 such 
that for N > No we have 


1/7 
x( Ln) <e 
n=1 

for some absolute constant C. In two papers Tang [92] presented explicit 
values for these constants No, C. 

For m € N let R,, denote the least integer such that there is a set A C 
Z/mZ with A+ A = Z/mZ and |{(a,b):a+b=n, a,b € A}| < Rm for 
all n € Z/m2Z. It follows from Ruzsa’s result above that R,, is bounded. 
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Chen [44] proved the uniform bound R,, < 288, and Chen and Tang gave 
better bound for certain m values of special form. 

Konyagin and Lev [73] studied and settled the Erdés-Turan problem in 
infinite Abelian groups. They determined what are the infinite Abelian groups 
G for which the analogue of the Erdés-Turan conjecture holds and what are 
the ones for which it fails, and in both cases they provide further information 
on the number of representations of the elements g of G in the form a+a’ = g 
with a,a’ belonging to a basis A of G. 

(See also a paper of Haddad and Helou [62].) 

In Sect. 4 we mentioned the conjecture of Erdés and Freud that if A Cc N 
is infinite and r2(A,n) is bounded then there are infinitely many n with 


r2(A,n) = 1, (A4.2) 
and probably there are more integers n satisfying (A4.2) than integers n with 
re(A,n) > 1. 


Our Theorem 5 above disproved this second stronger version of the conjecture 
of Erdés and Freud. Sandor [87] also disproved the weaker version of the 
conjecture by constructing an infinite set A of nonnegative integers for 
which r2(A,n) < 3 for all n and it assumes only the values 0, 2 and 3 
infinitely many times. Sdndor’s construction also disproves the conjecture 
formulated in our Problem 6 but it does not settle Problem 7. Moreover, 
in Sdndor’s construction the counting function A(n) of A grows slowly: 
A(n) = O((log n)?). Thus it remains to see whether there exists a set A such 
that A(n) >> n° for some c > 0 and all n, r2(A,n) is bounded, and (A4.2) 
has only finitely many solutions. 


A5. Sidon Sets: The Erd6és-Turdn Theorem, Related Problems 
and Results 


This has been a very intensively studied field in the last 15 years. Since the 
extent of this Appendix is limited thus we have to restrict ourselves to listing 
some of the most important papers written on this subject. If the reader 
wants to know more on the papers written on Sidon sets, then O’Bryant’s 
excellent survey paper [77] can be used, while for more information on large 
B,|g] sets one should consult the paper of Cilleruelo, Ruzsa and Vinuesa [51]. 

In our original paper we mentioned the result of Ajtai, Komlés and 
Szemerédi [1] on dense infinite Sidon sets: they proved that there is an 
infinite Sidon set A with A(n) >> (nlogn)'/?. Ruzsa [83] improved on 
this significantly by proving that there is an infinite Sidon set A with 
A(n) = nV2-140(1) 

Ruzsa [84] showed that there is a maximal Sidon set AC {1,2,...,N} 
with |A| < (N log N)¥/3, 
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Erdés, Sérkézy and T. Sés [19, 21] asked whether there is a Sidon set 
which is also an asymptotic basis of order 3. Deshouillers and Plagne [54] 
proved in this direction that there is a Sidon set which is also an asymptotic 
basis of order 7, and Kiss [70] improved on this result by showing that there 
is a Sidon set which is also an asymptotic basis of order 5. 

Answering a question of Sarkozy, Ruzsa [82] showed that there is a set 
A c {1,2,...,n} with |A] > (4 +0(1)) n'/? which is both additive and 
multiplicative Sidon set. 

Improving on a result of Erdés, Sarkézy and T. Sdés [19, 20], Spencer and 
Tetali [91] showed that there exists an infinite Sidon set A such that any two 
consecutive elements s; and s;41 of the sum set A+ A satisfy s;41 — 5; < 
Ga log s; (for i = 1,2,...) where C is an absolute constant. 

As far as we know Problems 8-12 are still open. 

In our original paper we mentioned the Erdés-Turan estimate (34) for 
the cardinality F(.N,1) of the largest Sidon set selected from {1,2,...,N}. 
By (34) we have F(N, 1) = N1/? + O(.N5/18). We remark that Babai and T. 
Sds [42] generalized the notion of Sidon set to groups and they studied the 
size of Sidon sets in groups. Among others, they proved that any finite group 
G has a Sidon subset of cardinality greater than c\G|!/3. This seems to be 
quite far from being best possible, however, as far as we know it has not been 
sharpened yet. 


A6. Difference-Sets 


Some recent results and problems on the connection of sum sets and 
difference sets are discussed in the survey and problem papers by Martin 
and O’Bryant [75], Nathanson [76], Ruzsa [85] and Gyarmati, Hennecart and 
Ruzsa [61]. 

We do not know about any papers related to Problems 14-17. 


A7. Generalizations 


Horvath [67] proved partial results related to Problem 18; however, the 
problem is far from being settled. 

On the other hand, we do not know about any papers related to 
Problems 19-24. In the case of the additive problems the reason of this is 
probably that the tools used in the special case of sums a, +---+a, fail when 
one tries to extend them to the general case cja, + ---+ crax. In the case of 
the multiplicative problems there does not seem to exist such a barrier, and 
one would expect that there is a better chance to achieve nontrivial results. 


Ramsey-Type Problems 
The problems of this type are getting quite popular. 
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Erdés, Sdrkézy and T. Sds [59] proved that for any k € N and any 
k-colouring of N, almost all the even numbers have a monochromatic 
representation in the form a+ a’ with a # a’. (This settled a conjecture 
of Roth.) In a recent paper Borbély [43] extended this result in various 
directions. (In another paper Erdés and Sarkozy [56] also studied the 
multiplicative analogue of the problem in [59].) 

Shkredov [90] proved both density results on the solvability of nonlinear 
equations of the type 


f(ai,..-,@n) =0 (A7.1) 


over Z/pZ and the existence of monochromatic solutions of equations of this 
type. 

Csikvari, Gyarmati and Sarkozy [53] also studied both density and 
Ramsey-type problems involving equations of form (A7.1) over Z/mZ, N 
and Q. Among others they extended Schur’s theorem [35] by proving that if 
n,k € N and the prime p is large enough in terms of n and k, then for any 
k-colouring of Z/pZ the Fermat equation 

gl + y” — gh 
has a nontrivial monochromatic solution in Z/pZ. Moreover, they conjectured 
that for any & colouring of N the equation 


at+b=cd, a¥b (A7.2) 


has a monochromatic solution, and they proved partial results in this 
direction. Later Hindman [63] proved this conjecture in a more general form. 

P. P. Pach [78] studied the following questions: is it true that if k € N, 
and m € N is large enough, then the Eqs. (A7.2) and 


ab+1=cd (A7.3) 


have a “nontrivial” monochromatic solution in Z/mZ for any k-colouring of 
it? He proved that in case of equation (A7.2) the answer is affirmative, while 
in case of equation (A7.3) one needs further assumptions on the prime factor 
structure of m to ensure the solvability. 

Starting out from a problem of Pomerance and Schinzel, Sarkdzy asked 
the following question: is it true that for any r-colouring of the squarefree 
numbers greater than 1 the equation ab = c has a monochromatic solution? 
Pomerance and Schinzel [80] proved that the answer is affirmative for r = 2, 
and P. P. Pach [79] also proved this for r > 2. 


A8. Probabilistic Methods. The Theorems of Erdés and Rényi 


Dubickas [55] slightly sharpened Theorem 9 by showing that one can take 
c, = €7/10 and cg = 2e + € in the theorem for any 0 < € < 1/2. 
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Erdés and Rényi [13] also claimed in their paper that Theorem 10 can 
be extended from sums of two terms to sums of h terms (for fixed h), i.e., 
there is a similar theorem on B;,[\] sets in place of Bz[\] sets. However, for 
h > 2 independence issues arise which are not at all easy to handle. This 
problem was cleared by Vu [95] who gave a complete and correct proof for 
the following theorem: for h € N and h > 2, and any € > 0 there is a constant 
g = g(€) and a By [9] sequence A such that A(x) >> x!/"®, and, indeed, one 
can take g,(¢) < e~"*1. (See also the paper [52] of Cilleruelo, Kiss, Ruzsa 
and Vinuesa.) 

We remark that the probabilistic approach is used in many of the papers 
mentioned in this Appendix. 

At the end of Sect.8 we mentioned a few papers to appear soon; these 
papers appear as Refs. [57, 91] and [58). 

* 

We remark that the results described above induce many further prob- 
lems. In a subsequent paper we will return to some of these problems and 
also present some related results. 
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Math. Soc. (2) 8 (1974), 393-400 (with R. C. Vaughan). 

[E11] Prime polynomial sequences, ibid. (2) 14 (1976), 559-562 (with S. D. 
Cohen, M. B. Nathanson). 


[E12] On the greatest prime factor of [| f(k), Acta Arith. 55 (1990), 191-200 
k=1 
(with A. Schinzel). 


The papers [E1], [E2], [E8] and [E10] concern the coefficients of the 
cyclotomic polynomial 


on(x) = [[@ — 1)hGD, 


d|n 
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Let A,, denote the largest absolute value of the coefficient of ¢n(x). It was 
proved by E. Lehmer in 1936 that 


An = 2(n'/?), 
In [E1] Erdés proved that 
log An = Q((log n)*/*) 
and in [E2] that 
logn 

log log Ay, = Q {| ——— } . 

mi (= log -) 
In [E8] he gave a simpler proof of the last relation and conjectured that 
logn 
log log An — 
aad “jog logn 


for every c < log2 and infinitely many n. This conjecture even for c = log 2 
has been proved by R. C. Vaughan [11], his result is best possible. 

The paper [E10] treats the coefficient a,(n) of x” in ¢,(x). The authors 
prove that 


la,(n)| < exp(27/271/? + cyr/8) 
and 


lim sup |a,(n)| > exp(ca(r/logr)/?),  (r > ro) 


n—->co 


where cj, C2 are absolute constants, cp > 0, 


= ee) 


p prime 
The subject is still alive as shows Maier’s paper [4]. 
Somewhat related to the four Erdés’s papers discussed is the paper [E9], 
in which the authors consider the functional 
n 


[[@-2*) 


i=l 


M (a1, a2,.--,Qn) = oo 
z\|=1 


(a1 <a <...< Gy are positive integers) and 


f(n)= min M(ay,...,an). 
QA gooey an 
They prove that log f(n) = o(n). This result has been sharpened by F. V. 
Atkinson [1]. 

In the paper [E3] Erdés considers the sequence Q(n) first studied by 
A. Rényi. Let N(f) be the number of nonzero coefficients of a polynomial f 
and let 


Q(n) = min N(f?), 


Arithmetical Properties of Polynomials 265 


where f runs through all polynomials f € Q|z] with N(f) =n. Rényi proved 
that Q(29) < 29. Erdés proves that Q(n) « n° for a certain c < 1 and quotes 
the conjecture at which he and Rényi independently arrived, namely that 


lim Q(n) = oo. 

n—->co 
A good value of the constant c has been given by W. Verdenius [12] and the 
above conjecture has been proved by the writer [7]. The result of [7] has been 
improved in [9] to the form 


Q(n) > 2+ (n> 1). 


log 8 


The subject is still alive as shown in Coppersmith and Davenport’s paper [2] 
and Zannier’s paper [14]. 

The papers [E4] and [E12] concern the greatest prime factor P(f,x) of 
TIz_1 f(&), where f is an irreducible polynomial of degree d > 1. The first 
estimate for P(f,z) in the case f = x7 + 1 was given by Chebyshev and his 
result was extended to all relevant polynomials by T. Nagell in 1921 in the 
following form 


P(f,x) > c(f,6)a(log x)'~* 


for all e > 0 and a suitable c(f,¢€) > 0. 
In [E4] Erdés proved that 


P(f,2) > x(log x)°/) 8g log@ for c(f) > 0, x > ao(f) 
and asserted that by a much more complicated argument one can show that 
P(f,x) > xexp((logz)°) for 6 = 6(f) > 0, x > a(f). («) 
An attempt made in [E12] to reconstruct the proof of (*) led only to a 
weaker estimate 
P(f,x) > xexpexp(c(log log x)!/%) for « > x2(f) 


where c>0 is an absolute constant. However G. Tenenbaum [10] has 
succeeded in proving (*) with any 6 less than 2 — log 4. 
In the paper [E5] Erdés considers the sum 


S(a) = Yd f(k)); 
k<a 
where d(n) is the divisor function and f € Z[a] is irreducible. He proves that 
axloga < S(x) < aaloga, 


where C1, Cp are positive constants depending upon f. 

The upper estimate, which is much deeper has been considerably 
generalized by D. Wolke [13] He has replaced d(n) by a multiplicative function 
and the polynomial f by a quickly growing integer valued function, both 
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functions subject only to mild restrictions. The subject is still alive as shown 
by Nair’s paper [6]. 

In the paper [E6] Erdés considers values of a polynomial free from k- 
th powers. It was proved by G. Ricci in 1933 that a primitive irreducible 
polynomial of degree d > 1 represents infinitely many integers free from d-th 
powers. Erdés improves this as follows: every irreducible polynomial of degree 
d > 2 without a fixed (d — 1)-th power divisor greater than 1 represents 
infinitely many integers free from (d — 1)-th powers. This result has been 
improved by C. Hooley [3], who has given an asymptotic formula for the 
number of integers in question and also by M. Nair [5] who has replaced d— 1 
by [Ad] +1, where \ = V2 — 4. Hooley gives a tribute to Erdés by saying “It 
is to the perspicacity of Erdés that we owe our present appreciation of the 
manifold uses to which sieve methods can be put” (l.c., p. xi). 

[E7] is a survey of problems and results, in which in particular [E4], [E5] 
and [E6] are discussed. 

The remaining paper [E11] contains the following theorem. Let F' € Z[z] 
be a polynomial of degree d > 2 such that F(n) > 1 for all n > 1. Let 
Or = {F(n)}%,. Then F(N) is called prime in Of, if F(n) is not the 
product of strictly smaller terms in Op. If F(x) is not of the form a(ba + c)?, 
then almost all terms of Or are prime in Of. The “almost all” is indeed, 
quantified. For a later, related work, see [8]. 
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Dedicated to P. Erdés 


Summary. Since 1934 Erdés has introduced various methods to derive arithmetic 
properties of blocks of consecutive integers. This research culminated in 1975 when 
Erdés and Selfridge (Ill J Math 19:292-301, 1975) established the old conjecture 
that the product of two or more consecutive positive integers is never a perfect 
power. It is very likely that the product of the terms of a finite arithmetic 
progression of length at least four is never a perfect power. In the present paper it 
is shown how Erdés’ methods have been extended to obtain results for arithmetic 
progressions. 


1. History Until 1976 


In a letter to D. Bernoulli written in 1724 Goldbach argued that the product 
of three consecutive positive integers is not a square. In 1857 Mlle. A. D. 
proved that it is not any perfect power. In the same year Liouville showed 
by use of Bertrand’s postulate that x(a + 1)---(« +k — 1) is not a square 
or higher power if at least one factor z,x +1,...,u+k-—1 is prime, or if 
k > a —5. In particular, k! is not a perfect power for k > 1. In 1917 Narumi 
[17] showed that the product of & consecutive integers is not a square for 
2<k < 202. In the thirties Szekeres proved that it is not a higher power for 
2<k <9. For more details, see Dickson [1, pp. 679-680], Obldth [18] and 
Erdos [2, p. 194]. 

In 1939 Rigge [22] and a few months later Erdés [4] proved that the 
product of two or more consecutive positive integers is never a square by 
developing Narumi’s proof. They further proved that for fixed | > 3 the 
equation 


a(x+1)---(2+k—1)=7/! (1) 
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has at most finitely many solutions in integers x > 0,k > 2,y > 2. In 1940 
Erdos and Siegel jointly proved that there is an absolute constant c such 
that (1) has no solutions with k > c, but this proof was never published. 
In 1955 Erdés [7] published a different, elementary proof by which c could 
be computed. The elementary method was developed by Erdos and Selfridge 
[9] in 1975 for proving that the product of two or more consecutive positive 
integers is never a cube or a higher power. See Sect.2 for a sketch of the 
proof. 

The equation (1) has no solutions in integersx>0,k>1,1>1,y>1. 

Actually the results were more general. Rigge [22] showed that if all prime 
factors of b are not greater than sk, then the equation 


a(e+1)---(c+k—-1)=by' inintegers c>0,k>1,1>1,y>1 (2) 
has no solutions with | = 2. Erdés [5] showed that 


Grn 


k )=u in integers t>k,k>1,l>1,y>1 (3) 


is impossible for / = 3. Note that equation (3) is the same as equation (2) 
with b = k! and that it involves no loss of generality to assume that x > k, 


since 
r+k—-1 _ r+k—-1 
k 7 z-1 )° 


In 1951 Erdés [6] proved that (3) has no solutions with k > 4. Observe that 
(°?) = 1407. Erdés and Selfridge actually proved that if k > 3 and « +k is 
greater than the least prime > k, then there is a prime p > k which divides 
the left side of (1) to an order which is not divisible by 1. They conjectured 
that for some p > k the order should be one. The results suggest that it may 
be true that (2) has no solutions with kl > 6 for any b composed of prime 
factors < k. See Sects. 5, 7 and 8 for results in this direction. 

In his 1955 paper Erdés made an extension into a different direction, 
namely that if from the k integers x,“ +1,...,2+k—1 less than (1 — «) 
k log log k/logk are deleted, the product of the remaining numbers is never 
an I-th power provided that ¢ > 0, k > c(e),1>2 andn> k’. For 1 =2 it is 
allowed to delete ck/ log k numbers. See further Sect. 9. 


2. Sketch of the Proof of the Erd6és-Selfridge Result 
for k > 30,000, [ > 3 


The proof of Erdés and Selfridge is split into the following cases: 

k > 30,000, 1 > 3; 4 < k < 30,000, / > 3; 1,000 < k& < 30,000, / = 3; 
4<k< 1,000, =3;k =3,1>3;1=2. The latter case is the 1939 result of 
Rigge and Erdos. 
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The first case is a quantitative version of Erdés’ 1955-result. The proof 
in this case can be divided in four parts. Suppose 


a(x+1)---(c+k—1)=y!' inintegers 2>0,k>1,1>2,y>1. (4) 
Then we write 
a+j=ajz, (0<j<k) 


where a; is not divisible by any /-th power >1. If p divides both a; and aj, 

then p divides 7 — j and is therefore less than k, It follows that prime factors 

>k only occur in the z;’s and that the a;’s are composed of prime factors <k. 
The four steps are: 


) 
) The products aja;(0 <i < j < k) are distinct, 

(c) There are & — m(k) distinct a;’s with product less than k!, 
) The products a;a;(0 <i < j < k) cannot be distinct. 


We have a closer look at each step. 

Step (a). 2 > k!. 

By Bertrand’s postulate there is a prime in (=, rt+k— 1]. Ifa <k, 
then otk > « and this prime divides the left side of (4) to the first power, 
a contradiction. If x > k, then a result of Sylvester, rediscovered by Schur 
(cf. [2]), states that «(a +1)---(a+k—1) is divisible by some prime greater 
than k. Such a prime divides only one of the k factors, sox+k—1> (k+1)!. 
Thus x > k!. 

Step (b). The products aja; (0 <i < 7 < k) are distinct. Since ged(a+g, 
xt+i)<k< Vx for g £1 by (a) and similarly gcd(a + 9,2 + j) < Vx for 
g # J, we see that x + g does not divide (x + 7%)(a + 7). Thus the products 
(ec +i)(ea+j) (0 <i < j < k) are distinct. Suppose agan = aja; with 
O0<g<h<k,0<i<g<kand (14+ 9)(a@+h) > (a +i)(a+ 7). Then 

(v+g)(w+h)—(x+i)(x +7) = a0; (2'a}, — aia'). (5) 

The left side of (5) is smaller than (x +k)? — a? = 2ka +k? < 3kax by 

(a). The right side of (5) is larger than 


ajajl(aja;)'~! > Uaytiajal) (VM Sy, 
Thus, by (a) and / > 3, 
ka > “DI > 74/3 > ke 


a contradiction. 

Step (c). There are k — 7(k) distinct a;’s with product less than k!. Since 
ag = a; implies aga; = a,a;, it follows from (b) that the numbers a; (0 <i < 
k) are distinct. For every prime p < k, we choose an f(p) in {0,1,...,k—1} 
such that the power of p in a, for 7 = 0,...,4— 1 is maximal for 7 = f(p). 
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In this way we select at most 7(k) elements a;. The total number of factors 
p in the remaining a,;’s is at most 


je) 4 [APP A) |S) TS... 


p p p p yp? 


<E++El~ 


By counting the number of factors p in k! we see that 


k! = Tpcap!*/?, 


Thus the product of the not selected a,;’s is less than k!. This argument of 
Erdés introduced in his 1955 paper has turned out to be fundamental. 

Step (d). The products aja; (0 < i < j < k) cannot be distinct. Here 
Erd6s and Selfridge apply an elegant graph theoretic lemma to give a 
quantitative version of a result of Erddés contained in his 1955 paper. 
A subgraph of a graph is called a rectangle if it comprises two pairs of vertices, 
with each member of one pair joined to each member of the other. 


Lemma 1 ([9, p. 295]). Let G be a bipartite graph of s white and t black 
vertices which contains no rectangles. Then the number of edges of G is at 
most 8 + (ie 


Proof. Let s; be the number of white vertices of valency i, so )),., $i = 8. 
Since there are no rectangles, each pair of black vertices is linked to at most 
one white vertex. A white vertex of valency i corresponds with (5) black 


pairs. Hence 
Da(3) < (5 
é ss eae 
i>2 


If E is the number of edges of G, then 


B= Dis =5+ Di-dse+ Da() <s+ Gr 


i> i>2 i>2 


The lemma is applied as follows. Let uy < ug <--- <u, and vy < v2 < 
-++ <u; be two sequences of positive integers such that every positive integer 
up to x can be written in the form ujv;. If ay < -+-- < a, < & are positive 
integers such that all the products are distinct, form the bipartite graph G 
with s white vertices labeled u;,...,u; and t black vertices labeled v1,..., v% 
and an edge between u; and v; if ujvj = am for some m. Distinctness of the 
products aja; ensures that G' has no rectangles, so, by Lemma 1, 


ee 
r<s 9)° 
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Using this inequality Erdds and Selfridge show that the product of any k — 
m(k) of the a;’s exceed k! by proving that 


a; > 3.5694(¢ — 304) for i < 6,993 (6) 

a; > 4.3402(i— 1,492) for i> 6,993. (7) 

For (6) specific sets {u1,...,us} and {v1,...,v,} are constructed with 
t= 25 ands < Tas? +4, for (7) with t = 55 and s < eaoa® + 7. With the 


inequalities (6) and (7) a routine calculation using Stirling’s formula suffices 
to contradict (c) when k > 30,000. 

It is remarkable that Lemma 1 and a variant of it can also be used to 
derive a simple proof of a result that played a key role in Erdés’ 1955-paper. 
By applying the Cauchy-Schwarz inequality at the last line of the proof of 
Lemma 1 we obtain 


E<s+) (i-1 sy < s+ SoG -1)2s a fnset 2(5 ) v8 < sttvs. 


i>2 1>2 i>2 


This yields 


Lemma 2. Under the conditions of Lemma 1 the number of edges of G is 
at most s + ty/s. 


Remark 1. A slight modification of the proof yields the upper bound $8 + 


Ee ee. 


3. Integers with Distinct Products 


Denote by N(X) the maximum number of integers 1 < by < by <---b, < X 
so that the products b;b; (1 <i <j <r) are distinct. Since we can take all 
primes < X, the number N(X) can be as large as m(X) ~ X/log X. Erdés 
proved the striking fact that N(X) can hardly be larger. In [3], published in 
1938, he proved N(X) < 1(X)+8X°/4+4X1/2, in [7] he showed by a different 
proof that N(X) < 1(X)+3X7/8 +2X1/2, Finally, in 1968/1969, he proved 
n [8] that 


N(X) — 1(X) « n3/4/(log n)?/? 
and that there exist sequences such that 
N(X) — 2(X) > n9/4/(log n)3/?. 


We use Lemmas 1 and 2 to give an elegant proof of the following estimate, 
applying arguments due to Erdés. 


Theorem 1. N(X) < 1(X)+X7/8 4 X3/44 X12, 
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Proof. Let 1 < b; < bg < --+b, < X be such that all the products b,b; 
(1 <i<j <r) are distinct. We write b; = u,v; where v; is the greatest divisor 
of b; which is not greater than X‘/?. First of all it is clear that the numbers 
ULV , Uj V2, UQU,, U2V2 cannot all be b’s, since (w1v,)(ugv2) = (uyV2)(U2V1). 

Next we show if b; = u,v; with v; < X1/4, then u; must be a prime. For if 
not, let p be the least prime factor of u;. If p< X!/4 then pu; < X!/? which 
contradicts the maximum property of v;. Since u; is assumed to be composite 
we have p < X'/2. Hence X1/4 <p< X'!/2, But then p > vu; which again 
contradicts the maximum property of v;. Thus u; must be a prime indeed. 

As before we form a bipartite graph G with s white vertices u; and t black 
vertices v; and an edge between u,; and v; if b; = u;v; for some 7. Distinctness 
of the products 6;b; ensures that G has no rectangles. 

First we count the number of edges in G with u-value greater than X°/4. 
Since uv < X, the v-value is then less than X 1/4. We have shown that in 
this case u is prime. By Lemma 1 we find the following upper bound for the 
number of edges: 


1 
Ey <1(X)+ ge 


Secondly we count the number of edges in G with u-value in (1, X°/4]. The 
v-value is at most X!/?, By Lemma 2 we find for the number of such edges: 
p< x3/4 4 X7/8, 


The number N(X) is bounded by Ey + Ea < 1(X)+X7/8 + X9/4 4 X12, 


4. Arithmetic Progressions Composed of Small Primes 


We consider the generalisations to finite arithmetic progressions. In step (a) it 
was shown that x is large compared with k. It sufficed to show that x(a + 1) 
-+-(x +k —1) was divisible by a prime p > k. In this section we consider 
the greatest prime factor of the product of the terms of a finite arithmetic 
progression. 

Let x,d and k be positive integers. We consider 


A := A(a,d,k) := «(a+ d)--- (a + (k— 1)d) (8) 


and in particular P(A), the greatest prime factor of A Bertrand’s postulate, 
proved by Chebyshev, states that for any positive integer k the sequence 
k+1,k4+2,...,2k contains a prime, that is P(A(k +1,1,k)) > k. In 1892 
Sylvester [32] generalised this inequality by showing P(A(a,d,k)) > k for 
x >k+d. Langevin [14] proved that P(A(a,d,k)) > k for x >k. 

If d = 1, the problem of determining the best lower bound for 2 is 
equivalent with the classical problem how large gaps between consecutive 
primes can be. Using results on this gap problem the condition « > k can 
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be replaced by « > k/13 when k > 118 by a result of Rohrbach and Weis 
[RW] and by x > k°48 when k is sufficiently large by improvements of the 
theorems of Hoheisel and Ingham (cf. [21, p. 193]). 

For d = 2,3,4 and 6 Breusch, Molsen, Rohrbach and Weis, and Erdés 
derived extensions of Bertrand’s postulate. References can be found in Moree 
[16] who extended the results to 54 values of d less than 1,000. Each extension 
provides an improvement of Langevin’s result for that value of d. The authors 
showed in [28] that 


P(A(a,d,k))>k if d>1 and (zx,d,k) ¥ (2,7,3). (9) 


They applied a sharp upper bound for 7(a) due to Rosser and Schoenfeld. 
In [31] they further proved that 


P(A(z,d, k)) > kmin (iogtoe(a + (k — 1)d), log (4 + ‘)) 


and that, for any ¢ > 0, 
P(A(a,d,k)) >- kloglog(a+(k—1)d) for x+(k—1)d> k(logk)*. 


These results are based on upper bounds for the solutions for Thue-Mahler 
equations due to Gyéory. 
The stated results enable us to conclude: 


Theorem 2. The equation 
a(x +d)---(¢+(k—1)d) =y/ (10) 
has no solutions in integersx >0,d>0,k>1,1>1, y>1 with Ply) <k. 


Proof. For d= 1 Theorem 2 follows from Theorem 1, for d > 1 from (9). 


5. Perfect Powers in Products of the Terms of an 
Arithmetic Progression, I 


A natural generalisation of equation (1) is 


a(x+d)---(a@+(k—1)d)=y' inintegers x >0,d>0,k>2,1>1,y>1. 

(11) 
Without loss of generality we assume that / is a prime number. Theorem 2 
implies that P(y) > k. For any numbers z,d and k it is easy to find a positive 
integer A such that the left side of (11) with x replaced by Az and d replaced 
by Ad represents a perfect power. To avoid such solutions we shall assume in 
the sequel that ged(x, d) = 1. 

Soon after he proved his joint result with Selfridge, Erddés conjectured 
that (11) implies that k is bounded by an absolute constant and later he 
conjectured that even k < 3. The theory on the Pell equation yields that there 
are infinitely many pairs x,d with gcd(«,d) = 1 such that «(a+ d)(a+ 2d) is 
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a square. On the other hand, Euler [10] (cf. [1, p. 635]) has shown that (11) 
has no solutions with k = 4,1 = 2 and Oblath [19, 20] has proved that (11) 
has no solutions with (k,1) = (5, 2), (3,3), (3,4) and (3,5). 

Marszalek [15] showed that k is bounded if d is fixed. More precisely, he 
proved that (11) implies that k < exp(C,d°/) if | = 2, k < exp(Cod"/2) if 
1 =3, k < C3d°/? if 1 = 4, k < Cyd if | > 5 with explicitly stated absolute 
constants C1, C2, C3, C4. 

We shall consider the more general equation 


a(x +d)--- (a +(k—1)d) = by’ in integers r > 0,d>0,k > 2,6>0,1>1,y>1 
(12) 


subject to gcd(a,d) = 1, P(b) < k, l is prime. One of us conjectured in 
[33, p. 219] that (12) has only finitely many solutions with kl > 6. Shorey 
[26] applied the theory of linear forms in logarithms to show that (12) 
with / > 3 implies that k is bounded by a computable number depending 
only on P(d). Shorey and Tijdeman [27, 29] derived various improvements. 
For example, they showed that there exist effectively computable absolute 
constants C's, C6, C7,Cg,Cg such that, for k > Cs, 


(a) d, > Cgk!-? (b) d, > k:C7 log log k 


(c) P(d)>Cslogkloglogk (a) > GkG>2) (3) 


where d, is the maximal divisor of d such that all the prime factors of d, 
are = 1 (mod /) and w(d) denotes the number of distinct prime factors 
of d. Since di|d, the results (a) (cf. [27, (2.7)]) and (b) (cf. [29, Theorem 
1 and (2)]) provide considerable improvements of Marszalek’s estimates. In 
particular k < d® for any ¢ > 0. Inequality (c) (cf. [29, Corollary 1]) shows, 
for 1 > 1, how k& can be bounded in terms of P(d). Inequality (d) (cf. [27, 
Theorem 1 and Corollary 1]) shows that for fixed / > 2 the number & is even 
bounded by a computable number depending only on w(d). 

First we shall illustrate the method by showing how the extension of 
Erdés’ ideas leads to a proof of (13.a). 


Theorem 3 ([27, (2.7)]). Let (12) hold subject to gcd(x,d) = 1, P(b) <k, 
1 is prime and let d, be the maximal divisor of d such that all the prime 
factors of d, are =1 (mod 1). Then, for k sufficiently large, 


d, > k-?, 


Proof. We assume | > 2. We compare with the steps in Sect. 2. 
Step (a) We show that x+ (k—1)d>k’. 
Write 


a+jd=Ajyi (0<j<k) (14) 


where A; is composed of primes < k and y; of primes > k. Since P(y) > k, 
one of the terms of the arithmetic progression is divisible by an /-th power of 
a prime > k, whence 
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g+(k-l)d>ki and x+d>hk'-, (15) 


Step (b) We show that we may assume that the A;’s with j > k/8 are distinct. 
Suppose that A; = A; for some i > j > 0. Then, by (14), 


(i — j)d = Aj(y; — 95) (16) 


Since gcd(x,d) = 1, we see that A;|(i — 7) whence A; < k. Hence, by step 
(a), yi > k and y; > k. On the other hand, d|(y} — yj). It is well known that 
every prime factor of 
Uw 

Ya 7 95 (17) 

Yi — Yi 
is either / or = 1 (mod 1) and that J occurs at most to the first power in (17). 
Consequently 


yi — Yj 2 a/ldy. (18) 


Now from (16) we derive 
kd > (i j)d > AW" (yi — yy Aga)" 
If 7 > k/8, then we obtain, by (15), 


d k (l-1)/I d ot 


which implies d, > ski-?, Thus we may assume that the numbers A; with 
i > k/8 are distinct. 
Back to step (a) We show x + (k —1)d > k'+1/4. 

If y; = 1, then x + id is composed of primes < k. By the argument in 
step (c) of Sect. 2 there are at most 1(k) selected A;’s and the product of the 
remaining A;’S is at most k!. Therefore, by (15) and / > 2, the number of i 
with 0<7i<k and y; = 1 is bounded by 


log(k!) 2k k 5k 
k)4 x 
ON) ee ieee Ta 
for k large. Let Sp denote the set of elements A; with i > k/8 and y; > 1. 
Then |So| > k/4. Note that y; > k for A; € So. Hence 


1 
—1)d> Ag) >=" 1 
z+(k ja eee yi) = ; (19) 
and 
1 I 
w+d> 5k. (20) 


Back to step (b). We show that we may assume that the A,’s withO <j <k 
are distinct. 
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Suppose that A; = A;, but 1 > j > 0. Then, as in step (b) above, but 
now with (20) instead of (15), 


1d 


d 
(I-1)/l I-1 
dk > 7 (e+) ae 


1 
which implies d; > ah 
Note. We now turn to steps (c) and (d) and only later deal with step (b). 

The argument given in step (c) of Sect. 2 applies similarly to the A,’s: 
Step (c). There exists a subset T of {0,1,...,4 — 1} consisting of at least 
k — m(k) elements such that []j-¢ Aj < ki. 

It follows that the average value of these A;’s is at most (k/e)*/(#-™()) 
which is asymptotically equal to k. A straightforward application of the box 
principle would yield that for any 7 with 0 < 7 < 1 there are at least 
nk numbers j > 0 such that A; < k‘/", This estimate is too rough for 
applications. The following lemma provides a useful sharpening of this bound 
when the set {A;},jer is large. 


Lemma 3 (cf. [27, Lemma 6]). Let 0 < 9 < 1. Put 5) = {Aj}jer. 

Suppose 

begs 
log k 

where g < (l1—1)logk. Then there exists a subset Sp of S1 with at least np 

elements satisfying 


[Si] > (21) 


Aj <Ck for Aj E So (22) 
where C' = exp((g +7 + .37)/(1 — — g/logk)). 


Proof. Let Sz be the subset of S; defined by (22). By steps (b) and (c) we 
have 


k! > Taes, A > (|S2|)!(Ck) 52! 1521, 


Suppose |.S2| < 7k. Then, by n! > (n/e)" for n = 1,2,... and the fact that 
(y/x)¥ is monotonic decreasing in y for 0 < y < w/e and (21), we obtain 


|S2| \ \TE ke oe 
I> | — = Il > (2 k— Togk 
Ri 2 (4 (Ck) 2 ia) oN 


k 
ee ee 
e1C7C39/ log keg , 


Since 7” > e~-3" for 0 <1 <1 and k* > k!, we obtain a contradiction. 


Step (d). The products A;A;(0 <7i< j < k) cannot be distinct. 
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By step (b) we know that the numbers A; (7 > 0) are distinct. Hence the 
set T in Lemma 3 consists of at least k — 1—(k) > k — 2k/logk elements 
for k large. We apply Lemma 3 with g = 2 and 7 = 1/3. Then we find a set 
Sy of at least k/3 elements A; satisfying 


Aj < 60k for A; € So 


for k large. By (20), y; > k for A; € Sp. We write S3 for the set of all A; € S 
with 
k k 
9 and A; > 5° (23) 
Then |S3| > &/10. It follows from Theorem 1 with X = 60k that the products 
A;A; with A;, A; € S3 cannot be distinct for k large. 
Step (b). The products A; A; (A;, A; € S3) are distinct. Since ged(#+ id, 7+ 
jd) divides both (i — j)a and (i — j)d and gced(x,d) = 1, we have gcd(a + 
id,x+jd) < |i—j| < k fori £ 7. Hence, by (20), ged(a+gd,xr+id) <k< Jz 
for g #7 and similarly ged(a + gd,x+ jd) < /z for g # j. It follows that the 
products (a + id)(a + jd) (0< i<j <k) are distinct. 

Suppose there are elements Ag, Ap, Ai, Aj of S3 satisfying A,Ap = A; A; 
with g i and g #7. Without loss of generality we may assume that 


(x + gd)(x + hd) — (a + id)(a + jd) = AiAs((ygyn)’ — (yiys)’) (24) 


is positive. Since d divides the left side and gced(d, A;A;) = 1, we see that d 
divides the difference of the l-th powers. Hence, by (18), 


i> 


d 
=i > 
Ygh — Vili Id 
Hence the right side of (24) is at least 
d 
(Ai Ag)! 5-((Aiyi)(Agyy)) 0" 
1 


Since A;, A; € S3, we obtain from (23) the lower bound 
R\2/ g h\2U-D 4 g 
d > Re + kd)?" 
(=) a (2+ 5a) Zgg er ke 


The left side of (24) is at most 2kd(a + kd). Comparing both bounds we 
obtain, by (19), 


atkd\'?/) 4 
of oe ee > LK-?2, 
162d, > ( i ) a 


This completes the proof of Theorem 3. 
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6. Some Applications of Brun’s Sieve 


Erdés has introduced some applications of Brun’s sieve which are used in the 
proofs of (13.b), (13.c), and (13.d), but have also interest in themselves. Let 
0 < z< X. Brun’s sieve implies that for any A > 0 the number ®(X,w) of 
integers < X free of prime factors < w satisfies 


1 
®(X,w) <a XI p<w(1——) for w< X4 
a Pp 
(cf. [13, p. 68]). Since II,<y(1 — a) < >i, we conclude that 


XxX 
@(X,w) <,——__ for w< XA: (25) 
log w 


Erdés [5] applies (25) in the following way. 


Lemma 4. Let b;,...,b;, denote all integers between z and X such that 
every proper divisor of b;, is at most z. Then 
xX 


*S jos(X/2) 


Proof. If z > VX, then b1,...,bs are prime numbers and the result is obvious. 
We assume z < VX. Put y = X/log(X/z). If bj, is larger than y, then every 
prime factor of b;, is greater than y/z. By (25) the number of such 0; is at 
most 
< = 
log(y/z) 
The number of 6;’s not exceeding y is at most y. Hence 
Z xX xX 
Ss ; 
log(X/z) — log(y/z) 


Since log(y/z) = log(X/z) — log log(X/z) > 4 log(X/z), the result follows. 


Lemma 4 can be used to show that in a dense sequence some numbers 
have a large common divisor. 


Lemma 5 (cf. Erdés [5]). Let r,z,X be integers with 0 < z < X. Let 
0O< ay < ag <-:+ <a, < X be a sequence of integers. Then there are at 
least 
cy xX 
-— 2- — 26 
"~*~ ioa(xa) - 


pairs a;,a,; for which gcd(a;,a;) > z where cy is some absolute constant. 


Proof. Denote by 6;,...,6, all integers between z and X having every proper 
divisor < z. Then, by Lemma 4, 
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cy xX 


* = Jog(X/z)’ 
Obviously every integer between z and X has a divisor among the b’s. Hence 
there are at least r— z — s pairs a;,a,; for which gcd(a;,a,;) is divisible by a 
b, whence ged(a;, a;) > 2. 


Remark 2. [fr > z+ 2s, then a better bound is possible. Adjoin to each a; 
larger than z one divisor b. If the number of a;’s corresponding to b; equal t;, 
then the number of pairs is (oy; Hence the total number of pairs is >;_, Cy. 
By the Cauchy-Schwarz inequality we obtain 


Ss tj it s ; 1 s 1 s 2 it Ss 
=x t— = if ti) —-—-x ty 
Dae es) 
w=1 w=1 t=1 w=1 t=1 
Hence we obtain, since gy t; >r—2z> 2s, 


Vo G) eRe Zz" 5 (r = (0 z)\(r—z—s). 


i=l 


Thus the bound (26) can be replaced by ~(r — z)(r — 2-8) where s = 
c,.X/log(X/z). 


Another result which can be derived by sieve methods and is used in the 
proofs of (13) is the Brun-Titchmarsh inequality. Let m(a;k,l) denote the 
number of primes < x which are =1 (mod k). Then, for sufficiently large «, 


3x 
m(a;k,l) < Hh) logta/®) (cf. [13, p. 110]) (27) 


where ¢(k) denotes Euler’s indicator function. 


7. An Application of Evertse’s Bound for the Number 
of Solutions of the Equation Az! — By! = dz 


In 1939 Erdés [5] also applied the following special case of Thue’s Theorem 
to prove that for ] > 2 and k > ko(l) the equation 


a(x +1)---(c+k—1)=y) 


is impossible. 

The number of solutions of Ax! — By' = C, where 1 > 2 and A, B,C are 
given positive integers, is finite. 

Evertse [12], cf. [11], has calculated bounds for the number of solutions 
of Az! — By! = C. Let R(l,C) denote the number of residue classes Z 
(mod C) with Z' = 1 (mod C). Evertse proved that the number of solutions 
of |Az!' — By'| = C in positive integers x, y with gcd(x, y) = 1 is bounded by 
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aR(l,C)+6 where (a, 3) = (2,4) if l= 3, (1,3) if = 4, (1,2) if 2 =5, (1,1) 
if 7 > 6. It turns out that the following estimates, proved similarly, are even 
more useful for our purpose. 


Lemma 6 (Evertse, [12, pp. 17—18]). The number of solutions of 


Aa! — By! =dz (A,B,d€ Zso,1 € Zsa, gcd(A,d) = ged(B,d) =1) (28) 


in integers x > 0, y > 0, z subject to gcd(x,y) = 1 and 0 < |z| < d@/5)-! is 


bounded by aR(l,d) + 6 where 


(3,6) if l= 
(2,2) ifl=4 
(a, B) = ¢ (2,1) if 1=5,6 
(1,3) if l= 
(Laie 8 


It follows from the data at p. 18 of [12] that R(I,d) = 1’°%) when 1 is 
an odd prime, where d, is the greatest divisor of d composed of primes = 1 
(mod 1). 

We shall demonstrate how Lemma 6 can be applied to (12) We are going 
to prove (13.d) in the sharpened form 


yw (a1) > Cok (1 > 5). 
logk 
Without loss we may therefore assume that [”(4) < ae If A; = Aj, for 
some 7 > j, then 


Aj(yj — yj) = (i — jd. 
Since gcd(A,;,d) = 1, we obtain A,|i — 7. Hence 
yi — y; = 2d 


where 0 <z<k <« d!/('—?), the latter inequality by Theorem 3. Since y; and 
y; are composed of primes > k& and are coprime to d, we have gcd(y;, y;) = 1. 
Furthermore, for k sufficiently large and 1 > 3, |z| < d?/5 — 1. On applying 
Lemma 6 we find that the number of pairs (i,7) with i > 7 and A; = A; is 
at most al”’(%) + 8 which is less than .9k/ log k by our supposition. 

The argument given in step (c) of Sect. 2 applies to the A;’s. Hence there 
exists a subset T of {0,1,...,4 — 1} consisting of at least k — ea — 1(k) 
elements A; such that [[,- Aj < k!. By the Prime number theorem we have 
m(k) < 1.1k/logk for large k. Hence we can apply Lemma 3 with g = 2 and 
n= z. As in Step (d) of Sect.5 we find a set 5S of at least k/3 elements A; 
satisfying 


A; <60k for A; € So 
for k large. Note that 
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Ais — Ajy; = (2 + id) — (x + jd) = di — j) 


so that (y;,y;,7—J) is a solution of (28) for A= A;, B= Aj, z=i-j<k. 
The number of possible pairs A;,A,; is too large to make a straightforward 
application of the Box principle to apply Lemma 6. However, we have 
Lemma 5 at our disposal. We apply this lemma with r = k/3, z = ek, 
X =60k and {a;}%_, is the ordered set of A; with A; <60k, where « > 0 
is so small that 60/log(60/e) < 3. We obtain that for k sufficiently 
large there are at least k/4 pairs A;, A; for which gcd(A;,A;) > ek. Since 
gced(A;,d) = gced(A,;,d) = 1, we can divide by ged(A;, A;) to arrive at an 
equation 


where B;, B; are positive integers bounded by 60/e and |Z;,;| < 1/e. Hence 
we have at most (60/e)? equations 


Aa! — By! = dz 


with in total at least &/4 solutions (x, y, z). By the Box principle at least one 
of the equations has at least (¢/120)?k solutions. We know from (13.a) that 
d is sufficiently large, whence [z] < d?!/5-!. We conclude from Lemma 6 that 


e\2 &k 
~)\) _* < qv) 7 
Can, logk ~ oH 


In this way we derive the following improvement of (13.d): 


Theorem 4. Under the conditions of Theorem 3 we have 


k 
AE 125). 29 
ee (2d) (29) 
For a slightly stronger form of (29) and for a proof not depending on 
Evertse’s result, we refer to [27, Corollary 1]. 


8. Perfect Powers in Products of the Terms 
of an Arithmetic Progression, III 


In Sect.5 we have shown how (13.a) can be proved and in Sect.7 we have 
derived a refinement of (13.d). We first show how these results can be used to 
obtain formulas of the form (13.b) and (13.c). Afterwards we say something 
on the remaining case | = 2. 

By Theorem 4 we have I”(%) >> k. Since every prime factor of d; is = 1 
(mod 1), the greatest prime factor P(d,) of d; has to be at least Iw(d,). By 
using the Brun-Titchmarsh theorem we even find that 
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If 1 > (logk)?, then (13.c) holds trivially. If 1 < (logk)?, then 
log k log k 
P |—-— 
ai) > log! “ log! 
Ifd > log log k, then (13.b) follows from (13.a). If < log log k, then Theorem 4 
implies that, for some constant c, 


dy = (cw(d,))"“ 


> log k log log k. (13.c) 


whence 
log k log k log log k 
—— log log k > ———>——_ 
log! cas ed log log log k 
which is slightly weaker than (13.b). 
The proofs given until now do not include the case | = 2. The proofs for 
1 = 2 start in a different way, namely by writing 


log d, > 


z+id = ax? (@¢ =0,1,...,k-1) (30) 


where a; is squarefree (cf. Sect.2). Hence x; can contain prime factors < k, 
but also in (30) we have P(a;) < k. It is now much harder to show that the set 
f{a;}*=g has at least k — ck/ log k elements. Because of the fact that the a;’s 
are squarefree, the product of the a;’s is essentially larger than [k —ck/ log k]! 
and this lower bound contradicts an upper bound, due to Erdés and Rigge, 
obtained by saving the powers of 2 and 3 in the product. This contradiction 
yields the results (13.b), (13.c), and (13.d) for 1 = 2. For details see [27, 
Sect. 3]. 

Looking once again at Erdés’ conjecture stated in the Abstract, we see 
that for difference d > 1 all results up to now are for length k sufficiently 
large. It follows from (13.a) and (13.b) that & should be much smaller than d. 
As long as we cannot disprove that there are arbitrarily large arithmetic 
progressions consisting of [-th powers there is no hope to prove Erdos’ 
conjecture. However, there is new hope to settle the latter assertion after 
Wiles’ announcement of a proof of Fermat’s Last Theorem. 


9. Generalisations and Extensions 


It will be obvious from the preceding proofs that it is not necessary to require 
that all terms x + id (i = 0,1,...,4—1) are almost-powers Ajy!. It suffices 
if there are enough almost-powers among them. As mentioned in Sect. 1 the 
question how many such numbers are required was first investigated by Erdés 
in 1955. Shorey [25] completed Erdés’ result by showing that also for | = 2 
the equation 


(x + d,)(a + dz)---(a +d) =y/ (31) 
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in integersa >0,t>0,/>1,y>1,0<d, < dg <---<d, < k implies 
that 


log log k 


t<k—-(1—-e)k (32) 


log k 
for k > ko(e),¢ > 0. The proof depends on the finiteness of integral solutions 
of hyper-elliptic equations under suitable conditions. Moreover, he sharpened 
Erdés’ bound (32) for 1 > 2 considerably. Combining the elementary method 
of Erdés with the theory of linear forms in logarithms, irrationality measures 
of Baker for algebraic numbers and the method of Halberstam and Roth 
on I-free integers, Shorey [24, 25] proved that, for k sufficiently large and 
1>2,t< mk with 


AT 45 2 vad 
= = > : 
V3 56? V4 6a" y< 3 for 1>5 and y< for | sufficiently large 
The authors [30] derived some results for the general equation 
(x + dyd)(x + dod) -- +(x + dd) = by’ (33) 


where b and d are positive integers with P(b) < k and gcd (x, d) = 1 and the 
other unknowns are as above. For example, they showed for any ¢ > 0 that 
(33) with | = 2 and k > ky (ce, w(d)) implies 
log log log k 

log k 
and that (33) with prime 1 > 2 and k > ko(e) and 1M <, kh(k)/logk 
implies that 


t<k—(l-e)k 


t<k-(1- 
Sh=(l—¢) log k 
where 
_ flogloglogk ifl=3 
nye { loglogk if 1>5. 
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1. Introduction 


Les fonctions arithmétiques associées aux systemes de représentations d’en- 
tiers, comme le développement dans une base donnée, satisfont généralement 
des relations de récurrence qui facilitent considérablement l’étude de leur 
valeur moyenne. Considérons par exemple la somme des chiffres en base 2, 
que nous désignons par a(n). On a 


a(2n) = o(n), a(2n4+1) =a(n)4+1 (n > 1), (1) 
d’ot il découle que la fonction sommatoire S(m) := )locmen 7(m) satisfait a 
S(2n) =n+2S(n), S(2n+1)=n+o(n)+2S(n) (n> 0). (2) 


En particulier, si l’on pose y(n) := S(n) — (nlog n)/(2log2) (n > 1), la 
premiere relation (2) implique y(2n) = 2y(n) pour tout n, de sorte que l’on 
peut écrire 


_ logn _ niogn | logn 
ln) = no (P82), oO) Qlog2 | no (Per). 2) 


ou G est périodique de période 1. 
Dans la quasi-totalité des exemples connus, on obtient de méme, sans 
trop de difficulté, une formule du type 


So dm = P(n) + Q(n)G(log n) + R(n) (4) 
m<n 
pour une fonction arithmétique donnée {a,}°°_,, o&1 P et Q sont des 
fonctions régulieres — typiquement des combinaisons linéaires de produits 
de puissances n® et de puissances de logarithmes (logn)°’—, R(n) est un 
terme résiduel, souvent périodique et/ou borné, et G est une fonction 
oscillante périodique a caractére fractal, en général continue. 
Trollope [41] a donné, pour la fonction G de la formule (3), une formule 
explicite impliquant en particulier qu’elle est continue, et partant bornée. 
Utilisant une méthode plus simple, Delange [10] a généralisé le résultat au cas 
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de la somme des chiffres en base g > 2 quelconque, que nous notons g,(7), 
soit 


-—1 logn 
» og(m) = Fogg en + nGa (=) (5) 


m<n 


ot. G, est continue et 1-périodique. Il montre en outre que Gy n’est nulle part 
dérivable et détermine son développement de Fourier. 

A cété de celles de Trollope et Delange, plusieurs autres techniques sont 
en fait susceptibles de fournir le calcul explicite de G,. Nous utilisons au 
paragraphe3 une approche assez générale fondée sur l’intégration complexe. 
Voyons ici, par exemple, comment fonctionne, dans le cas q = 2, celle de 
Brillhart, Erdés et Morton dans [2]. Soit « un nombre réel positif, dont le 
développement en base 2 est 

£= S- €,/2", 


r>0 


avec €9 € N,e, = 0 ou 1 pour r > 1, et ¢, #1 pour une infinité de valeurs 
de r. On pose 


Le = |2*a| = S- re ag et ha 


O<r<k 


S(t,) xlog(2*z) 
2k 2 log 2 


Ona vp, = 2a%~~-1 +€x. Grace a (2), on en déduit par un calcul de routine que 


E-0(Xk-1) Er 
Ti, — Tr-1 = Qk » gr+l’ (6) 


rok 


ce qui implique l’existence de v(x) := limp... Ty. De plus, par itération puis 
passage a la limite en k, la relation (6) fournit 


a 


Lorsque x =n €N, onaég =n,€, = 0 (r > 1) et Von retrouve bien (3). 
De plus, lorsque € est un rationnel dyadique positif de dénominateur réduit 
2™(m > 0), ona 


lim o(#,) = ee : a) 


aE 


ae ér(a) = 4 €,-(€) —1 (r =m). 
1 
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Cela permet de vérifier facilement que y est continue sur Rt. La fonction 
G de (3) peut donc étre prolongée en une fonction continue sur R* par la 
formule 


G(u) = o(2")/2". 


Puisque les quantités (logn)/log2 sont denses modulo 1, la propriété de 
périodicité observée plus haut est encore valable pour le prolongement. 

La littérature abonde en exemples de situations similaires — ainsi qu’on 
pourra s’en convaincre a la lecture de notre bibliographie, issue de celle 
rassemblée dans la thése de Cateland [5]. Désignons, conformément a l’usage, 
par qg-noyau d’une suite {a,}°2, l’ensemble des sous-suites 


{nN 4 dginrp ik 20,0< 7 < q*}. 


La généralisation naturelle de la propriété (1) est celle des suites 
g-automatiques, i.e. dont le q-noyau est fini, voire des suites q-réguliéres, 
c’est-a-dire dont le q-noyau engendre un module de type fini — cf. [1]. 
La quasi-totalité des exemples connus reléve effectivement de ces deux 
définitions. 

Dans cette note, nous nous intéressons plus particulierement a la non- 
dérivabilité des fonctions fractales G apparaissant dans des formules de 
type (4). Peu de résultats généraux sont disponibles dans cette direction. 
Les travaux les plus significatifs sont ceux de Dumont-Thomas [11-13] et 
Cateland [5]. On distingue essentiellement trois types de méthodes : d’une 
part celles qui exploitent l’expression exacte de G(x), généralement sous la 
forme d’une série liée a la représentation de x dans un systéme adéquat, 
d’autre part celles qui établissent l’existence d’équations fonctionnelles pour 
G (c’est en particulier la voie explorée par Dumont et Thomas), enfin 
celles qui utilisent les divers renseignements disponibles sur les coefficients 
de Fourier de G — ce qui ne fournit en général qu’une preuve de la 
non-dérivabilité presque-partout, et non partout. Nous nous proposons de 
développer ici une quatrieéme approche, sans doute la plus naive de toutes. 
Elle consiste 4 “oublier” la définition explicite de G pour ne retenir que la 
formule (4): comme Ie membre de gauche est arithmétique, donc irrégulier, il 
est naturel d’attendre que le membre de droite contienne lui aussi un certain 
degré d’irrégularité — qui doit alors étre nécessairement le fait du terme 
fractal G(logn). Il reste ensuite & opérer un “relevement” des valeurs de 
la variable, en transportant les propriétés des G(logn) aux G(x) ot x est 
un nombre réel quelconque. II] est vraisemblable que ce principe puisse étre 
formalisé dans un contexte assez général. Nous nous contentons ici de le 
mettre en ceuvre dans trois cas particuliers importants de la littérature. 

Le premier exemple est celui de la suite de Newman-Coquet 


Cn = (fee, (8) 
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Coquet [7] établit la formule sommatoire 


logn 
; =a L 
Cm =n Go (7) + 31(n), (9) 


m<n 


ot. Go est continue et |-périodique, et ot l’on a posé 


aoe log 3 ice 0 si n est pair, 
~~ log 4’ ~ | (-1)7@"-9) si n est impair. 


Nous montrons, directement a l’aide de (9) et sans utiliser expression de Go, 
le résultat suivant, qui est d’ailleurs implicitement contenu dans la preuve de 
Coquet de la non-dérivabilité de Go. 


Théoreme 1. La fonction Go nest dérivable pour aucune valeur de x € R. 
Plus précisément, on a pour tout x € R 


Go(a +h) —Go(x) =Q(\A|”) — (h 0). (10) 
Nous considérons ensuite la suite de Rudin-Shapiro 
ry = (-1)™, avec e(n) := S- €j€j41 si n= S- ej 2), 
j20 j20 
Brillhart, Erdés et Morton établissent dans [2] la formule sommatoire 


S- tm = VnG (725) (11) 


m<n 


ou G est 1-périodique, bornée, et continue sauf aux points (log n)/ log 4, 
n € N*, et prouvent que G; n’est dérivable en aucun point x normal en 
base 4. Dans [12], Dumont et Thomas montrent que cette derniére restriction 
est inutile. Notre approche directe fonctionne ici trés simplement et fournit 
le résultat suivant. 


Théoréme 2. La fonction G, n’est dérivable pour aucune valeur de x € R. 
Plus précisément, on a, pour tout x € R, 


Gi(e@+h)-—Gi(z) =Q(V|A]) — (h 0). (12) 


La troisieme application concerne les suites digitales, introduites par 
Cateland [5], et qui sont une généralisation de la somme des chiffres en base q. 
Pour q > 2,¢ > 1, notons E(q,2) ensemble des ¢-uples ¢ = (€0,...,€¢-1) 
avec €; € {0,...,g— 1} pour tout j. Pour n EN, on écrit n = 2, €;(n)q? 
le développement de n en base q et l’on pose 


Ex(n) := (ex(n),.--,en4e-1(n)) € E(q, 2) (k =0,1,...). 


On note encore 
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avec la convention e(n;0) = 1 pour tout n. La fonction o(n;¢) est donc égale 
au nombre d’occurrences du mot ¢ dans la représentation q-adique de n. 
Etant donnée une fonction F : E(q, £) > C telle que F(0) = 0, on définit une 
suite digitale {ur(n)}°2_) par la formule 


ur(n) =) F(ex(n))= SY) Fle)o(nie). (13) 


k>0 E€B(q,e) 
On retrouve la suite o,(n) en choisissant 2 = 1 et F égale A l’identité. 
Cateland a établi, par la méthode de Delange, la formule sommatoire générale 
I 
S~ up(m) = Apnlogn +nGr ( | + 6p(n) (14) 
hope log q 
avec 
= Tingg PO 
= lac ’ 
q log q E€E(q,£) 


et ot! Gr est continue et 1-périodique, et dp est q’~!-périodique. Nous 
donnons une preuve assez simple de ce résultat au paragraphe3. Notre 
objectif principal consiste 4 déduire de (14) le résultat suivant de non- 
dérivabilité, qui étend optimalement celui de Cateland. Comme nous le 
verrons, la démonstration, reposant sur le principe énoncé plus haut, est 
extrémement simple. 


Théoréme 3. Soient q > 2, > 1,F : E(q,l) > C, et {ur(n)}%9 la 
suite digitale correspondante. Une condition ncessaire et suffisante pour que 
la fonction 1-périodique Gr associée soit nulle part dérivable est qu’il existe 
un entier a > 1 tel que ur(q’1a) £0. 


Le résultat de Cateland était conditionnel 4 ’hypothese Ar 4 0. Lorsque 
ur(q 1a) = 0 (a > 1), ur est périodique, Ar = 0, et la fonction Gp est 
constante. 

L’auteur tient ici 4 remercier Jean-Paul Allouche pour son aide précieuse 
lors de la préparation de cet article. 


2. Démonstration des Théorémes 1, 2 et 3 


Prouvons d’abord les Théorémes 1 et 2. Soit x € [0,1[. Nous écrivons le 
dveloppement 4-adique de 4*, soit 
a = Soe /4, 
jZ0 
avec 0 < ej; < 3 pour tout j et e; #3 pour une infinité d’indices j. Ensuite, 
nous définissons, pour k > 0, les nombres réels x, y, et V’entier nz par les 
formules 


294 Gérald Tenenbaum 


re = ret = ST eA I, om tL aunt, (15) 
O<jxk 


En écrivant (9) pour n = ng et n = ng +1 et en effectuant la différence, il 
vient 


Cn, = (Me +1)”Go(yr) — neGo(xe) + $(n(nz + 1) — H(nk)), 
d’ou 
ne{Go(yx) — Go(ae)} — en, — 3{n(me +1) — n(re)} + O(ngr"). (16) 


Compte tenu de la définition de 7(n), il est clair que le second membre est 
de valeur absolue >> 1. Par ailleurs, il découle immdiatement de (15) que 


Nk -~ 4k _ (Yk = xR). (17) 


Il suit 
|Go(yx) — Go(ax)| >> (ye — te)”. 
Comme 
max{|a — a,|, |x — yx|} < 4-*, (18) 
cela contredit 
Go(z + h) — Go(z) = o(|h|”) (h + 0), 


et partant implique la conclusion requise (10) du Théoréme 1. 
La situation est encore plus simple pour le Théoréme 2. On obtient 
parallélement a (16) 


VrKe{Gi (yr) — Gi(te)} = Tn, + O(1/ Vink); (19) 


d’ot par (17), puisque |rp, | = 1, 
IGi (yn) — Gi(an)| > Vue — fk. 


Grace a (18), cela implique (12) et établit ainsi le Théoréme 2. 

La méme approche fonctionne encore pour établir le Théoréme 3. 
L’hypothése F' 4 O implique ur # 0, et, plus précisément, implique 
existence d’un entier a,1 < a < q‘, tel que ur(a) # 0. En effet, notant 
F*(h) := F(€0,.-.,€j-1) pour h= ieee Erg’, ona 


urG)= >> anF*(h) (l<j<d') 
0<h<q? 


avec ajn > 1 si j est de la forme 7 = a+ q*h avec s > 0,a < q’, et ajn = 0 
dans le cas contraire. En particulier, on a a;; = 1 pourl <j < q! et ajn = 0 
sih > j. La matrice carrée (a;;,) est donc triangulaire supérieure, avec des 1 
sur la diagonale principale. Par conséquent, elle est inversible et cela établit 
la propriété indiquée. 
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Cependant, lorsque £ > 2, on peut avoir ur(q’~'a) = 0 pour tout 
a > 1 sans que up soit identiquement nulle: pour ¢ = gq = 2 et F(1,0) = 
—F(0,1) =1, F(1,1) = 0, ur est la fonction indicatrice des nombres impairs. 
On peut vérifier facilement que, dans un tel cas, ur est périodique, Ar = 0, 
et la fonction Gp est constante. 

Soit alors a > 1. Nous allons montrer que si Gr est dérivable en un point 
x € [0,1[ alors up(q’-!a) = 0. On écrit les développements g-adiques 


a= > e(aq’, g= Sd e;/¢, 


0<r<m(a) j20 
et l’on pose 


L := 2+ m/(a), Np = grhtht* = gh S- esq’), ne tl = quater, 
O0<j<k 
(20) 


Ona 
ny <Q" = (Ye — ue); max{|x — xz|,|a — yrl} < q*. (21) 


Ici et dans la suite de cette démonstration les constantes implicites peuvent 
dépendre de a ou F' mais pas de k. 

En appliquant (14) avec n et n+ 1 et en faisant la différence, on obtient, 
lorsque n = 0 (mod q*~!) 


x 


ela Ae logn +n {Gr pe ~) ae (=e) (22) 


log q log q 


Sane) + 5p(1) ~ be(0) 


ou l’on a tenu compte de la périodicité de dr. Substituons n = nx dans cette 
relation. La périodicité de Gr nous permet de remplacer Gr (log(n+1)/ log q) 
par Gr(y,) et Gr(logn/logg) par Gr(arx), soit 


upr(n) = Arlogn+n{Gr(yx) — Gr(xe)} 
+Ap(n+1)log(1+1/n)+Gr(yx) + dr(1) — dr(0). (23) 
Si Gr est dérivable au point x, on a, lorsque k > co, 
Gr(xg) — Gr(x) = (ay — 2G‘ (x) + o(a@g — @) = (2p — 2)G'p(x) + O(1/nx), 
et similairement 


Gr(yk) — Gr(x) = (yx — £)G'p(x) + o(1/nx). 


+Apr(n+1)log(1+1/n) 4 Gr ( 


Il suit 
_ Gp(@) 


ne{Gr(yr) — Gr(xe)} = re(Yr — 2e)G'p(x) + o(1) = ee + o(1). 
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En reportant dans (23), on obtient 


up(ne) = Ap logny + Ar + a + G(x) + 5p(1) — dr (0) + 0(1), 
et donc 
up(n,) = kArplogqg+ B+ o(1), (24) 
avec B := Ap{1+ (x +L) log gq} + G(x)/log¢+ Gr(x) + br(1) — br (0). 
Substituons maintenant n = n;, + q’ 1a dans (22). Les calculs qui 
précédent restent valables mutatis mutandis, et l’on obtient 
ur(np +q° 1a) = kAplogg+B+0(1). (25) 
Or il découle immédiatement des définitions de ur et L que 
ur(ng + q' 1a) = ur(ne) + ur(gh 1a). 


Les relations (24) et (25) impliquent donc par différence 
ur(q ta) = o(1), cest-a-dire up(q’ ta) = 0. 


Cela termine la démonstration du Théoréme 3. 


3. Preuve de la formule de Cateland par 
intégration complexe 


Nous nous proposons ici de donner une démonstration de la formule (14) en 
utilisant la formule de Perron. La démarche, semblable a4 celle de Flajolet 
et al. dans [15], posséde le double avantage de ne nécessiter que quelques 
calculs assez simples et de fournir directement les développements de Fourier 
des fonctions Gr et dp. 
Au vu de (13), nous pouvons nous restreindre a estimer la valeur moyenne 
de o(n,€) pour ¢ = (€0,...,€¢-1) € E(q, @) fixé. Posons h := os e;q’ et 
Vin):= So olmse)= S> S\ xK(mje). (26) 
0<m<n k>00<m<n 


La somme intérieure est égale au nombre des entiers m € [0,n — 1] qui sont 
de la forme m = a+ q*h + q**+*b avec 0 < a < q*,b > 0. Elle vaut donc 


(+e [aMaseY)) = Be 


O<a<q* O<a<qk *% 


“LC (ole BeeaD ane [aD 


Pour établir la premiére égalité, nous avons utilisé le fait que l’intégrande du 
second membre est constant sur chaque intervalle |a, a + 1]. 
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Pour évaluer la partie entiere de la derniere intégrale, nous introduisons la 
fonction zéta de Hurwitz, définie, pour chaque valeur du paramétre a €]0, 1], 
par la formule 


C(s;@) = So(n +a)* (Res > 1), 
n2>0 


et prolongée en une fonction méromorphe dans le plan complexe tout entier 
ayant pour unique singularité un pole simple en s = 1, de résidu 1. On a pour 
z>0 


ctioo 
It|e-al=sof cisiaye  (e>1), 


271 Joico s 
sauf siz € a+Z, ot le membre de droite vaut |x — a] + $. On obtient donc 


ace ree n \* dsdt 
vin) = Soot f a ¢(s;t/q°) (42) . 


2—i00 


1 2+%400 n s 1 h+1 ; 
© Bri Io ico (3) a(Lo= gr") | cleibig:)-dtde 


1 ". (3) ve Na ; 


Qi q) s(1—s)(1—¢q!-s) * 


2—it00 


ot l’on a posé 


1/q 
Z(s;t) := C(sit-+ at) — clsit) =s f ¢(s +1;¢4+ u)du. 


Déplacons maintenant l’abscisse d’intégration vers la gauche jusqu’a l’axe 
Res = 5. La contribution du pole double en s = 1 vaut 


ae {Z(0;h/q') (log(n/q") — 1+ 5 logq) + 2'(0,h/q‘)}- 


La contribution des péles simples py, := 1 + 2rki/logq (k 4 0) est égale a 


4 Z(pe — 1; nit), (He) — ny, (128) aoa) 


en pe(1 — pr) log ¢ log q log q 


avec la notation traditionnelle e(t) := exp{27it}. La fonction gp est 
1-périodique, et sa série de Fourier, explicitée ci-dessus, est absolument 
convergente. En particulier, g, est continue. 


Ona 
Z0;h/q) =a, ¢'(0;a) = log (72 


ie , Z'(0;a) = log (Ma + q)/T(a)) 
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Il suit 
on T((h + 1)/q*) logn 
V(n) => qe log q {een 1 T log ae + NGh log q + dn(n), 
(27) 
avec 
1 prPton (n\* Z(s—1;h/¢°) 
nln) = i (=) s(1- ya — gas) 8) 


Nous évaluons 6;,(n) en faisant appel 4 l’équation fonctionnelle de la 
fonction zéta de Hurwitz, soit 


¢(s;a) =T(1— s) S- (2rmi)ste(ra) (Res <0), 
reZ\ {0} 
ot le logarithme complexe est pris en détermination principale. On en déduit, 
en posant a = h/q’, 
Z(s—1;a) =—s(1—s)P(-s) S> (2rmi)*-? e(ra)fe(r/q’) — 1}. 
rEZ~{0} 


Reportons dans (28) en développant 1/(q'~* — 1) = ay q~*G-s), Tl vient 


e 1/2+%00 
gd e(ra){e(r/q°) — 1 1 . kb) s 
6n(n) = 5 ( Melr/a) } 5 5 cf _ T(-s) (2rrnig* ‘) ds. 
reEZ~ {0} k>1 1/2-ico 


Par la formule de Mellin inverse 


1 1/2+i00 
=| I(-s)a*ds =e* — 1 (x > 0) 
1 


201 J1/2-ico 
(ot: le terme —1 provient du péle de I 4 Vorigine), on obtient 


5n(n) = > 3 ge(rh/q’ ey )- Dey shone 


An? 
rEZ\ {0} 1<k<e q 


Cela implique que 6;,(n) est bien une fonction g’~!-périodique de n et, compte 
tenu de (27), achéve ainsi la démonstration. 


Ajouté a la seconde édition: 

En 1903, Takagi [39] a exhibé un exemple de fonction continue non 
dérivable plus simple, mais essentiellement de méme nature, que celui, bien 
connu, de Weierstrass. La fonction de Takagi est définie par la formule 


(oS [2a (x ER), 


owt ||z|| désigne la distance du nombre réel z a l’ensemble des entiers. Cette 
fonction, qui a récemment suscité un intérét soutenu dans la littérature 
(voir notamment [19, 25]) est directement liée 4 la fonction périodique G 
apparaissant dans le terme résiduel de la formule (3) de Trollope—Delange. 
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To Paul Erdés, who held the torch. 


During the summer of 1975, I spent a few days with my mother and sister 
who were on holidays near La Baule. I had just left Ecole Polytechnique, 
and needed some rest after the military service. For 8 months I had been a 
sub-lieutenant in the 2°"°RAMA, a semi-disciplinary unit based in Vernon, 
Eure, and felt rather depressed after what had been for me a dreadful 
experience. For the time being, my main concern was the starting of my 
research in mathematics. I had regular “night-dreams”, and also daydreams, 
seeing myself “content-free” as a mathematician, working hard but having 
no ideas—and, of course, no results. 

Here I was on the beach, reading Hardy & Wright and trying to make 
a few notes on a pad in spite of sand and wind. I soon became fascinated 
by arithmetic functions (a subject completely new to me) and considered it 
hardly believable that the number r(n) of representations of an integer n as 
a sum of two squares had such nice and simple properties. On the one hand, 
I was trying to reconstruct the proof of the main formula, viz 


= 1+ (—1)” 
r(n) =4 II (v +1) II (A) 
p’||n, p=1 (mod 4) p’||n, p=3 (mod 4) 


along the lines of the algebraic number theory course I had taken at the 
University of Paris, writing here and there on the pad: revise Kummer’s 
Theorem. On the other hand, I was doing small experiments such as 
computing explicitly the first integer with a given value for r(n). The smallest 
n with r(n) = 32 is 


1105 = 5 x 13 x 17. 


There are four genuinely distinct ways of writing 1105 as a sum of two squares, 
namely 


1105 = 237 + 247 = 317 + 12? = 32? + 9? = 337 4 4?. 


Don’t you agree that this set of representations is rather odd? 

Today, nearly 20 years later (this is odd too, isn’t it?), I would probably 
consider the occurrence of three consecutive squares (317,327,337) as an 
epiphenomenon, and turn my attention to what I would regard as deeper 
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subjects. But, at that time, I had nothing else to bite on and so started right 
away to try to describe the set of those integers N which have two genuinely 
distinct representations 


N=2? +y? =(24+1)? +27. 


This turned out to be very easy, as the reader can imagine, and I soon found 
out that these N are exactly those which can be written as 


N= {a +(q+1)?}Hr? + (r +197} 
with suitable relative integers g,r not equal to 0 or —1. One then retrieves 
et=r+2qr+q, y=qtrt+l, z=aq-r. 


For q = r = 1, we obtain N = 25 = 47 + 3? = 524+ 0?, and N = 1105 
corresponds to q = 6,r = 2. 

Thus, 1105 was not a unique specimen and actually had infinitely many 
relatives! This was nice and called for generalization. The next, obvious 
step was to replace the condition that the squares should be consecutive by 
imposing that they should be squares of integers with prescribed difference 
n, in other words to search all N which can be written as 


N=2?4+y? =(a4+n)? +2? 


with {x,y} A +{x+n, z}. This was not much more difficult, and I ascertained 
that these integers are exactly described by the formula 


N={P+(q+n)P Hr? t+ (r+) 7}? 


where 2gr = 0 (mod n) and qr(q+n)(r + n)(2q + n)(2r + n) 4 0. I denoted 
by g(n) the smallest such N, and left for Bordeaux with my own arithmetical 
function in my pocket. 

At this stage the reader might wonder when Erdés is going to enter the 
scene. Such expectation, however, is perhaps significant: in one of Maurice 
Leblanc’s best novels, L’Eclat d’obus, one does await the hero Arsene Lupin 
for the major part of the book, but his presence is increasingly felt as 
ineluctable as the plot is developed—and the fact that one of the chapters is 
entitled 75 ou 155? is a purely formal coincidence for which number theorists 
should give no hasty interpretation. 

I arrived in Bordeaux in the early days of September, 1975, and went to 
see Francois Dress who, I was told, had worked on sums of squares. In fact , he 
was interested in Waring’s problem and had obtained new bounds for g(4)—I 
mean, of course, the g-function, usually used with argument & and defined 
as the smallest integer such that all numbers are sums of at most g(k) kth 
powers.' This perhaps explains, at least partly, why he wasn’t immediately 


' Balasubramanian, Deshouillers and Dress established in 1986 that g(4) = 19, 
thereby closing up (in a certain sense, which would take us too far to describe here) 
Waring’s classical problem. 
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crazy about my g-function. To tell the truth, he thought this was a load of 
piffle, but, not wanting to discourage an enthusiastic young man, he said that, 
in his opinion, the significance was e. This g-function wasn’t simply nor nicely 
defined, after all, and he believed that even writing a computer program to 
tabulate it up to n = 10° or 10!° might not be an easy task. A few days 
later, I came up with the following formula for g(n). Put t(7) = $(a? +1) 
and define, for integer n, the functions 
PN itt OO gS a 

(Thus pi(n) and p2(n) are the two divisors of n closest to \/n.) Then, denoting 
by p a prime number, we have 


g(1) = g(2) = 25,  g(p) = 5t(p) (p> 2), g(2p) = 10e(), 
g(n) = t(pi(n))t(p2(n)) (nm odd, not a prime number), 
g(2n) = 4t(p1(n))t(p2(m)) (m not a prime number). 


I was quite happy with this result and had already applied it to determine 
the limit points of the set {g(n)/n? : n € N}, as well as the asymptotic 
behaviour of the average (1/N) >7,,<y 9(n).-. Dress understood I was not 
going to give up easily: he erased the whole blackboard, except these new 
functions p1,p2, which “were defined in less than 14 characters”, so not 
unnatural; moreover, they seemed not to have been studied before... 

By this time, about a month had passed, and I had settled in Bordeaux, 
where the number theory group, created under the impulse of Pisot, was 
developing in a kind of semi-familial everyday life—with its well-known 
pleasant and not so pleasant implications. My ‘older brother’ (and future co- 
adviser with Dress) was Jean-Marc Deshouillers, with whom I had frequent 
discussions. He patiently taught me all basic notions in the field of arithmetic 
functions as well as recent directions of research, and I tried to apply these 
to my two newcomers. 

Everybody had already noticed that these questions where ‘Erddés-like’. 
After a while it became apparent that the hard problem was to evaluate the 
average order of p(n). Up to this date, this is still open. Dress had offered 
50 francs for a proof or disproof of 


gt S- pi(n) = o(V2). 


After about 2 weeks of struggle with bare hands, I opened Halberstam and 
Roth, Chap. V, and got in one night the bounds 


/z/loga « «~" > pi(n) < V2/log log x. 
n<xx 


This was something, but obviously insufficient, and I had no idea of what the 
next step should be. 
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Fortunately, a meeting on elementary and analytic number theory 
organized by Richert, Schwarz and Wirsing was scheduled in November, if 
my memory is right, at Oberwolfach. Deshouillers and Dress were invited and 
promised to ask Erd6és about p;. They came back with an amazing answer: 
Erdos conjectured that 


2 3 pi(n) = Va/(log 2), 
n<xx 
with, believe me or not, 6 = 1—(1+log log 2)/ log 2 = 0.08607. ..!! He added 
that he thought the method was similar to that of his ‘Russian paper’—the 
only paper Erdés ever published in the Russian language, although, as he 
told me later, he does not know any Russian. 

My friend Didier Nordon provided a translation and I started to study the 
paper, with Deshouillers’ help. This wasn’t an easy job. remember (and, I am 
sure, so does Jean-Marc) a five-hour train trip between Paris and Bordeaux 
during which we tried to understand the notation, sometimes rather obscure, 
and above all the ‘philosophy’ of the man who knew about numbers. 

Hardy and Ramanujan proved in 1917 that almost all integers n have 
about log log n prime factors, but it was Erdés who really understood all 
the possibilities opened up by this theorem. Indeed, Hardy and Ramanujan’s 
proof gives more than the so-called normal order of the function Q(n), equal to 
the total number of prime factors of n, counted with multiplicity. Essentially, 
the extra information is that the distribution of values of O(n) among integers 
n <x is roughly Poisson, with mean and variance log log x. The peak of the 
Poisson distribution is very narrow, and the tails are dominated, in first 
approximation, by single values. Erdés took advantage of this situation by 
clever splittings of the integers according to their number of prime factors 
(possibly in a given range), which shed light on otherwise rather intractable 
problems. 

Let us give an example with the following reasoning, typical of his 
approach. At the same time this will give an idea of the proof of the upper 
bound in the 6-conjecture—which I could establish, along the lines foreseen 
by Erdés, by the end of 1975. In Erdés’ setting (the one I used at the 
time), the computations would involve factorials, because of the Poisson 
probabilities, and the optimization would be rather cumbersome. However, 
the ‘parametric method’, introduced in this context by Richard Hall and 
which we subsequently developed together,” simplifies the technicalities a 
great deal. Suppose we want an upper estimate for the number S(x) of 
integers n < a with $,/z < pi(n) < Vx. We give ourselves a parameter 
A€ (1, 3) and split the integers up to x into two classes, according to whether 
Q(n) > Aloglog«x or not. The number of elements of the first class may be 
bounded, for any v € (1, 3), by 


? See our book Divisors for an expository text on this subject. 
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S- yon) —A log log « < x(log gyn Amer, 

n<xx 
The best choice is clearly v = A, so the exponent of log « becomes —Q(A), 
with 

Q(A) = AlogA—A+1>0. 

The above estimate follows from a classical result on non-negative arithmetic 
functions (see e.g. Halberstam and Richert, 1979) and we only mention that 
the version needed here may be proved elementarily in a few lines. As for the 


elements of the second class, the following expression is certainly an upper 
bound for any w € (0, 1] 


S- word log log x Ss 1 


n<X<a d\n,4./x<d<VJ/e 
= (log z)~* "sv S- we S- wm) 
§ Va<dK Ja m<a/d 
where, in the right-hand side, we have written n = md and permuted 


summations. Estimating the inner sum as previously, we obtain that this is 


< x(log gyv—1-A log w S- we /d K x(log gta reee 
bVa<d<Ve 


The optimal choice is now w = 5A, which does belong to (0,1], and the 
subsequent exponent of logx is —Q(A) + Alog2 — 1. Putting our estimates 
together, we deduce from standard calculus that we must select A = 1/ log 2. 
The absolute value of the exponent becomes Q(1/log2), which is equal to 
Erdés’ miraculous 6! 

Erdés came to Bordeaux in March 1977. At that time, he taught me about 
the law of iterated logarithm for prime factors (i.e. the jth prime factor of a 
normal number is roughly expexpj) but in the form of a basic consequence: 
the largest divisor of n all of whose prime factors do not exceed y is itself 
usually not much larger than y. The following, not unrelated device also 
came up in the conversation: if you want to find out whether an integer has 
a divisor in a given interval (y, z], you should essentially look at those prime 
factors of n in the range (z/y, z], Of course, these mottos weren’t formulated 
this way, and the light only came gradually. Nevertheless, I learned a great 
deal during this first visit, and I must say that all other meetings proved just 
as fruitful for me. It would be unfair not to mention that this unconventional 
teaching was delivered with extreme patience and kindness. 

Thus, Erdés, in the first instance, showed the way into the study of 
divisors by giving the basic devices; this is invaluable: everybody having 
worked in analytic number theory knows that the first step is very often 
the most difficult. But a second help, equally important, came from the 
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conjectures he made. These acted as a permanent measure of the depth of 
our methods and the quality of our knowledge. 

In the first rank of these, I would place the conjecture, made in the late 
30s, according to which almost all integers have at least two divisors d, d’ 
with d < d' < 2d. This was precisely the question to challenge the current 
model for the set of divisors of a normal integer. The point is important; let 
us be more explicit. According to the law of iterated logarithm the jth prime 
factor of n, say p;(n), satisfies log p;(n) ~ e’; hence, the divisors d should be 
described (assuming n squarefree for simplicity) by the formula 


logd = ye eje! 
1<j<Q(n) 


where the ¢; take the values 0 or 1. However, since e > 2, these quantities 
have increasing differences, and hence the smallest ratio between consecutive 
divisors should normally stay away from zero, and indeed should be large 
on a set of positive density. (This last statement is justified, for instance, by 
the fact that we can assume all prime factors to be large on a set of positive 
density.) Thus, the conjecture is really about the effect of the error terms 
in the law of the iterated logarithm. This explains simultaneously why it 
is interesting and why it is hard. This problem shares with most of Erdés’ 
questions the feature of being phrased in the simplest form which contains 
the whole substance of the matter. It was eventually solved in 1983 by Maier 
and myself, and the proof led to the following evaluation (another conjecture 
of E.P.) valid for almost all integers 
: / = 1—log 3+0(1) 
ae d'/d=1-4 (logn) ; 

As a matter of fact, it would be hard to imagine a way to solve the initial 
problem that wouldn’t give quantitative results of this kind: the question is 
so pertinent that only a deeper understanding of the structure of the set of 
divisors will yield an answer. This experience, and many others of a similar 
type, explain why I find it sad to hear, from time to time, Erdés’ problems 
qualified as anecdotal or scattered. This criticism comes from occasionally 
powerful, but always ignorant, people. 

Since Erdos’ first pioneering articles in the thirties, quite a few mathe- 
maticians have taken part in the strange quest which I entered through 1105. 
It would, of course, take us too far to give here a complete survey of the results 
that have been obtained, even during the last two decades. Nevertheless, I 
would like to mention another recollection. 

At the Durham Symposium on Analytic Number Theory, organized 
in 1979 by Halberstam and Hooley, a (gentle) mathematical controversy 
occurred between Erdés and Montgomery about the normal behaviour of 
the function 


G(n) = |{1 <j <7(n) : dj(n)|dj41(n)}I, 
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where d;(n) denotes the jth divisor of n. Erdés thought that G(n) = o(7(n)) 
for almost all n because this fitted with another conjecture of his, namely 
that, almost always, 


tt (n) = |{k € [1, logn/ log 2] : dd, dln, 2* < d < 2**"}| = o(r(n)). 


(Note that G(n) < 7t(n) for all n, since dj4i(n)/dj(n) > 2 whenever 
d;(n)|dj41(n).) The above bound for 7+(n) would have trivially implied 
the d,d’-conjecture mentioned earlier, and indeed it could be regarded as 
a kind of ultimate consequence of Erdés’ conception that the set of divisors 
could be described by big aggregates with very large gaps between them. 
Montgomery’s argument was probabilistic. Assume for simplicity that n is 
even, say n = 2m. Then at least one half of the divisors d;(n) divide m; 
for such a divisor, we have d;(n)|dj41(m) unless dj41(n) < 2dj(n) because 
2d;(n) is a divisor of n. However, the intervals (d;(n),2d;(n)] occupy, on 


a logarithmic scale, a set of measure < (log2)r(n) = (logn)!°?+°™) inside 
the interval (1,n], of measure log n. Therefore, it is conceivable that the 
probability that a divisor belongs to one of these intervals tends to 0. As 
soon as this probability is less than 4 —c, we have G(n) > cr(n). 

Thus, one can see that an apparently innocent question about a seemingly 
artificial function turns out to be critical to our understanding of the structure 
of the divisors. It was well-known that, on a slightly larger scale, the structure 
was indeed mainly in aggregates, so Erdés’ question was really: how strong is 
this tendency? Is it so strong that it can destroy probabilistic effects even 
in very short intervals of constant multiplicative length? As a matter of 
fact, it turned out in this case that Erdés’ conjecture was wrong-headed. 
In a joint work (1981), he and I proved that the density of those n with 
G(n) < er(n) tends to 0 with c.° It remains that the question was well-posed, 
and that the answer revealed precious information about the multiplicative 
structure of the integers, and particularly on the phenomena which involve 
the conflict between the ordering imposed, on a large scale, by the law of 
iterated logarithm and the local, random perturbations. 

The purpose of the present text is to give a comprehensible (if not 
comprehensive) account on Erdés’ extraordinary stamp on this part of 
Probabilistic Number Theory. The reader interested in further information 
may consult, besides the research bibliography, our book with R.R. Hall, 
Divisors. A synthetic approach, in the frame of fractal sets, which provides 
a complementary description of some models of the divisors is presented 
in a joint article with Michel Mendés France (1993): the fractal dimension 
of the set of divisors of a normal integer is log 2. The whole subject 
has been discovered, developed and guided by Erdés’ theorems, intuitions, 
and conjectures. Let these few lines be a token of admiration to the 
mathematician, and of gratitude to the man. 


3 But this density is strictly positive, which shows that the tendency on which 
Erdos based his conjecture is nevertheless quite constraining. 
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Added at the second edition: 

The reader will find an update of many topics discussed in this paper 
in my paper entitled Some of Erddés’ unconventional problems in number 
theory, thirty-four years later, to appear in the acts of the Erddés centennial 
conference, Budapest, July 2013. 
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III. Randomness and Applications 


Introduction 


It would be foolish to try describing Paul Erdés as an applied mathemati- 
cian, somebody who is looking outside of mathematics for motivation and 
justification of his activity. Yet we believe that the word application in 
the title is justified. An essential part of Erdés’ personality and success 
is his broad knowledge and a true feeling of unity of mathematics. This 
understanding brought him to many of his crucial discoveries and topics. 
Randomness is a pivotal example of this Erdés approach. With M. Kac he 
initiated this technique in number theory in 1939 (see the Erdés paper in 
this volume) and in graph theory he did so in 1946. This technique—the 
probabilistic or nonconstructive method—is by now one of the universally 
accepted modern combinatorial techniques and this is also reflected by papers 
in this section. The origins of the probabilistic method are described in the 
paper by Spencer while the random graph papers of Erddés and Rényi are 
described in the Karonski-Rucinski paper (with update). Applications of 
probabilistic methods have reached virtually all mathematical disciplines as 
well as many areas of theoretical computer science. The papers by Pyber, 
Pudlak and Sgall, and Razborov are such examples. This does not exhaust the 
papers relevant to probabilistic methods in this volume, see for example, the 
papers in the second volume by Bollobas and Fiiredi, Kahn, Laczkovich and 
Ruzsa, and the paper by Cameron (devoted to infinity). All of these papers 
are related to various aspects of Erdés’ work and belong to branches pioneered 
by him. By now this is well recognized and, there are, in fact, numerous books 
devoted to these areas, such as: B. Bollobés: Random Graphs, B. Bollobas: 
Extremal Graph Theory and also E. M. Palmer: Random graphs, P. Erdés, 
J. Spencer: Probabilistic methods in combinatorics and N. Alon, J. Spencer: 
The probabilistic method, and P. D. T. A. Elliot: Probabilistic number theory 
(to name just a few). 
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1. Tic-Tac-Toe-Like Games 


The object of this 50% survey and 50% “theorem-proof” paper is to 
demonstrate recent developments of some of the ideas initiated by Erdés 
[17, 18], Erdés and Selfridge [20], Erdés and Lovasz [19] and Erdés and 
Chvatal [15]. 

The story begins with a class of deterministic games of complete 
information. 

The most well-known chapter of combinatorial game theory deals with 
Nim-like games (a player unable to move loses). A beautiful theory was 
developed by Bouton, Sprague, Grundy, and recently by Berlekamp, Conway, 
Guy and others—and of course, Conway’s theory of “numbers and games” 
(see the remarkable book of B-C-G [13]). These ideas can be employed 
in games in which the positions are composed of several non-interacting 
simpler games. Then the first thing to do is to associate values (numbers or 
“generalized numbers” ) with these parts. Next comes the problem of finding 
ways of determining the outcome of a sum of games given information only 
about the values of the separate components. 

Here, however, we discuss a quite different branch of combinatorial game 
theory. This branch is in such an early stage of its developments that 
the experts didn’t even find a right name for the subject: it is frequently 
called as Tic-Tac-Toe-like Games, or Positional Games, or Pattern Games, 
or Achievement Games. We prefer the name “Tic-Tac-Toe-like games”. 
In contrast to Nim-like games which start out as composites, or develop 
into composites in the normal course of play, these games usually remain 
as single coherent entities throughout play. So the theory of Nim-like games 
(or Neumann’s “theory of games”) has little relevance to such “condensed” 
games. 

The best introduction to these games is Chap. 22 Lines and Squares of 
Winning Ways by Berlekamp, Conway and Guy. The traditional ideas in Tic- 
Tac-Toe-like games are the “strategy stealing argument”, “pairing strategies” 
and in general, the trick of “decomposition into non-interacting local games”. 
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Here we make a more systematic use of Ramsey Theory, and emphasize the 
importance of a relatively new idea what we call the “probabilistic intuition” . 
This leads to very effective global weight function strategies, and at the 
same time, it provides surprisingly accurate predictions to the outcomes 
of many complex games. We consider Tic-Tac-Toe-like games as a sort of 
bridge between the two well-established chapters of “Random Graphs and 
Hypergraphs” and “Ramsey Theory”. Also we include some applications in 
Complexity Theory. 

As a warm-up we start with two well-known board games which are won 
by the first player to complete some kind of winning pattern. 

Tic-Tac-Toe (or Noughts and Crosses): The game board is a big square 
which is partitioned into 9 = 3 x 3 smaller squares. Whoever moves first puts 
a cross in one of the nine small squares. His opponent then puts a nought into 
any other square and then they alternate nought and cross in the remaining 
empty squares until one player wins by getting three of his own squares in 
line. There are precisely eight winning lines. 

Hex: The game board is a rhombus of hexagons, the actual size of the 
board being a matter of agreement between the players. Each player takes a 
pair of opposite sides, his move is to take an untaken hexagon, and his object 
is to form a continuous chain between his two sides. Hex was invented by 
Piet Hein. 


Strategy Stealing Argument 


A winning strategy for a player is a list of instructions telling the player that 
if the opponent does this, then he does that so if he follows this strategy, 
he will win. In a tic-tac-toe type game, one can argue that the first player 
can achieve at least a draw. This follows by the well-known strategy stealing 
argument. 

Suppose otherwise the second player has a winning strategy, and the first 
player steals it. Then he can use this strategy to win the game. He randomly 
takes his first move, and then pretends himself as the second player. He reads 
the instruction to take action. If he is told to take a move that is still available, 
he takes it. If this move was taken by him before, then he randomly takes 
another move. The crucial point here is that an extra move only benefits him. 
In this way, he can guarantee a win. Therefore, the second player cannot have 
a winning strategy, which implies that the first player can guarantee at least 
a draw. 

Remember, however, when a draw is impossible, the first player wins. It is 
easy to see that in Hex a draw is impossible. Therefore, whoever plays first 
wins. We must warn the reader that this is a theoretical result, which assumes 
that the first player doesn’t make mistakes. If he makes a mistake, then the 
opponent can take advantage of the mistake, and might manage a win. 
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The reader possibly noticed that the two games above have some 
differences. In the tic-tac-toe game, both players can take any move, and 
they also share the same winning sets. In the Hex game, both can take any 
move, but they have different winning sets. 

We define the class of generalized tic-tac-toe games as follows. Let there 
be given a finite set X, which is the board of the game, and a family of 
winning subsets of X, say F = {A,, Ao,..., Am}, A; C X. The two players 
alternately take an element from X which is still available. That player who 
takes a complete winning set first is the winner. This family of games contains 
the tic-tac-toe (noughts and crosses) game as a particular case. 

Since an extra move in a generalized tic-tac-toe game does not harm a 
player, by the strategy stealing argument, whoever plays first has a drawing 
strategy. 


Theorem 1. Let X be finite, and F be an arbitrary family of subsets of X. 
Then the first player can force at least a draw in the generalized tic-tac-toe 
game on (X,F). 


However, the “complexity” of finding this (at least) drawing strategy 
is enormous. Indeed, a strategy for the first player is a function f with 
domain = “the set of subsequences of the board X” such that the “next 
move” f(#1,Y1,---,;®i—1, Yi-1) is always an element of X different from the 
“previously selected” elements 21, y1,..-,%i—1, yi-1 of X. In a play according 
to this strategy, the first player determines all his moves by f as follows: 
Suppose the players alternately picked the elements 21, y1,...,%i—1, yi-1 Of X 
in this order. Then the first player’s ith move is x; = f(@1, y1,.--,2i-1, Yi-1)- 

Let |X| = N. Then the total number of strategies is between 


22" and NN”, 


The amount of computation demanded by a computer to find an optimal 
strategy is a plain exponential function of the size of the board. Human brains 
can sometimes diagnose shortcuts, but we cannot expect substantial shortcuts 
in general. Even “small” board games are so complex that it is possible 
that the existing drawing (or winning) strategies may never be found. As a 
discouraging example, we mention that, Owen Patashnik, of Bell Lab, was 
the first to find a first-player winning strategy in the 3-dimensional 4 x 4 x 4 
tic-tac-toe (or as better called “tic-toc-tac-toe”). His 1977 program required 
1,500 h of computing time, and the winning strategy contains 2,929 strategic 
moves. 

We emphasize that the strategy stealing argument is highly nonconstruc- 
tive. It does not give much help in finding an explicit winning or drawing 
strategy for the first player. For example, no explicit winning strategy in Hex 
is known. 
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When Can the First Player Win? 


There is a rather general sufficient condition for the first player’s win. 
Consider a generalized tic-tac-toe game on (X,F). The chromatic number 
of F is the least integer r such that the elements of X can be colored with 
r colors yielding no monochromatic A € F. If the chromatic number of F is 
> 3, then draw is impossible, so by the strategy stealing argument, the first 
player has a winning strategy. 


Theorem 2. Suppose that the board X is finite, and the family F of winning 
sets has chromatic number at least 3. Then the first player has a winning 
strategy in the generalized tic-tac-toe game on (X,F). 


Note that it is a “hard” problem to decide whether a system (X,F) has 
chromatic numbers > 3. We have to check all the 2!*! two-colorings of X. 

There is a well-established chapter of Combinatorics, called Ramsey 
Theory, which is entirely devoted to families F of chromatic number > 3. 
We associate games with three famous results of Ramsey Theory. 


Ramsey Game 


If S is a set, then let [S]* denote the family of subsets of S containing exactly 
k > 2 elements. Following the set-theoretical traditions, we identify the 
natural number n with the set of its predecessors, that is, n = {0,1,...,2—1}. 
So [n]? can be regarded as a complete graph with n vertices, that is, [n]? = 
Ky. Let 2 <n < N. The board of the game is [N]? = Ky, and the two players 
alternately occupy edges of this graph (i.e., elements of [N]?) and that player 
wins who picks all the edges of a complete subgraph with n vertices (i.e., all 
the elements of [5]? for some n-element subset S C N = {0,1,...,N —1}) 
first. This game is denoted by R(N,n) = Ro(N,n). 

For k > 3, the game R,(N,n) is a trivial generalization. The players 
alternately occupy k-element subsets of N, and the winner is who picked all 
the elements of [S]* for some n-element subset S C N first. 


Van der Waerden Game 

The game W(N,n) is played on the board X = {0,1,...,N — 1}, and the 
winning sets are the arithmetic progressions of n terms from X. 
Hales-Jewett Game 


This is a straightforward multidimensional generalization of the game tic-tac- 
toe. The two players alternately put their marks in the cells of a d-dimensional 
cube of size n x n X --» x n =n. The winner is the first player to have n of 
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his marks in a line. More precisely, the board of the game H J(d,n) is the 
set of d-tuples 


X = {a= (a1, a9,...,aa):0< a; <n for each 1 <j <d}. 


The winning sets of H J(d,n) are those n-element sequences 
(a, a®, dines ai”) ) 


of the board X such that, for each j, the sequence a, rigs ee ae composed 


of the jth coordinates is either strictly increasing from 0 to n — 1, or strictly 
decreasing from n — 1 to 0, or constant. 

The well-known theorems of Ramsey, Van der Waerden, and Hales and 
Jewett state that for every integer n there is a finite threshold number 
rz,(n), w(n) and h(n), respectively, such that the family of winning sets in 
the Ramsey Game, Van der Waerden Game and Hales-Jewett Game has 
chromatic number > 3 if N = r;,(n), N = w(n) and d = h(n), respectively. 
Therefore, by Theorem 2, the first player has a winning strategy in these 
games. Unfortunately this theorem is rather weak, because either the best 
upper bound on the threshold number known at present is very poor, or the 
threshold number is in fact enormous. Consider e.g. the Van der Waerden 
threshold number w(n). The best upper bound on w(n), due to Shelah, 
is primitive recursive (Van der Waerden’s original argument couldn’t even 
provide a primitive recursive upper bound), but it is still enormous for 
“pedestrian mathematics”. For example, it is an open problem whether 
w(n) < exp,(n) holds, where exp, denotes the k-fold iteration of the 
exponential function exp(x) = e”. On the other hand, the best known lower 
bound is plain exponential: w(n) > 2”. The situation is exactly the same in 
the case of the Hales-Jewett theorem. 

In the case of Ramsey theorem, however, the upper and lower bounds are 
much closer to each other. It is known 


rl? < r9(n) <4", (1) 


gan 


2r'/6 < rs(n) < 27", (2) 
and in general, for arbitrary k > 8, 


eXPy_o(Ck*) <re(M) < exPg_1 (Ce); (3) 


where cy, and cj, are absolute constants depending on k only. 

It seems to be highly unlikely that the breaking points for the behavior 
of these Ramsey type games is anywhere close to the behavior of the 
corresponding Ramsey threshold functions, but no method is known for 
handling these problems. Note that even the existence of a breaking point 
is questionable. It may well happen that the first player wins e.g. the Hales- 
Jewett Game H J(d,n), but the H J(d+1,n) game is a draw. 
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Pairing Strategies 


By far the most common technique to guarantee a win or a draw is to find 
a decomposition of the board into disjoint pairs, and when your opponent 
takes one member from a pair, you take the other one. 

Every child “knows” that in the usual tic-tac-toe (i.e., the Hales-Jewett 
Game H J(2,3)) the second player can force a draw. A precise mathematical 
proof of this fact can be found in Chap. 22 of Berlekamp, Conway and Guy 
[13]. If the board is 4 x 4 and the object of the game is to find 4-in a-row (i.e., 
the Hales-Jewett Game H J(2,4)), then again the second player can force 
a draw. Exactly the same holds for all the Hales-Jewett Games of the type 
H J(2,n), n = 5,6,7,8,... (see B-C-G [13]). We just give a quick proof of 
the cases n = 5 and 6 by the following pairing strategies: whenever the first 
player occupies a numbered cell, the second player takes the other cell of the 
same number. 


2105 51 
691289 
n=5: 611 *« 114 
7101274 
13 3 82 


If the first player takes the +-labeled center, the second player may take any 
cell, and if the cell he is required to take by the pairing strategy is occupied, 
he may play anywhere. 


1 13 2 13 3 12 
6 14 5 14 4 12 
pink 7 8159 1015 
16 3 11 1 16 2 
17 4116175 
7 8 18 9 1018 


What is more, this last numbering leads to an elegant proof that the 
unrestricted 9-in-a-row (i.e., the board is an infinite chess board, and the 
object of the game is to find 9-in-a-row orthogonally or diagonally) is a draw. 
Cover the infinite board with copies of the 6 by 6 matrix above. The second 
player can force a draw by always taking the nearest cell with the same 
number as of the previous play. It is easy to see that the first player can 
obtain no line longer than 8. 

unrestricted 8-in-a-row: It is known that the second player has an explicit 
drawing strategy. However this strategy is not a pairing strategy. To illustrate 
the idea, first we outline a second proof of the fact that the unrestricted 
9-in-a-row is a second player’s draw. 
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Decomposition into Non-interacting Games 


It was first proved by Pollak and Shannon in 1954 that for 9-in-a-row the 
second player can force a draw, by using the following strategy. Tile the plane 
with H-shaped heptominos (seven squares): the second player plays ordinary 
tic-tac-toe in each of these 7-square regions, concentrating on preventing a 
line of 3 in either a diagonal, or the horizontal or the right vertical. 

Using the same idea T.G.L. Zetters (nom de guerre of some Amsterdam 
combinatorists) recently showed that the second player can even draw 8-in- 
a-row. Their proof uses a parallelogram-shaped tile of 12 cells, and goes some 
way towards showing that 7-in-a-row is also a draw. 

Next we give a general sufficient condition for the existence of pairing 
strategies. 


Theorem 3. Consider a generalized tic-tac-toe game on a finite system 
(X,F). Assume that for any subfamily G C F, 


U Al > 2\G]. 


AEG 
Then the second player can force a draw by a pairing strategy. 


Proof. By the well-known Konig-Hall theorem, we can find disjoint 2-element 
representatives: h(A) C A (A € F), |h(A)| = 2, h(A) N A(B) = O whenever 
A and B are different elements of F. (A technical twist: we in fact apply 
the Konig-Hall theorem to the “duplicate” of F, i.e., every A € F is taken 
in two copies.) Now whenever the first player occupies one member from a 
2-element representative, then the second player takes the other one. 


Note that there are well-known efficient (i.e. polynomial time) algorithms 
to actually find the disjoint 2-element representatives. 


Corollary 1. Let F be an n-uniform system, t.e., |A| =n for every A EF. 
Further assume that every x € X is contained by at most n/2 members of F. 
Then the second player can force a draw by a pairing strategy. 


Proof. Observe that the criterion of Theorem 3 applies here. Indeed, for any 
subfamily G C F, by a standard double-counting argument, 


agl=Siil= YD YS is U4} 5. 


AEG reEU ycg AAEG EA AEG 


and dividing by n/2, 


U Al > 2\G]. 


AEG 
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Note that Theorem 3 is very general. It doesn’t restrict the global size 
of ¥ at all, and in fact it holds for infinite boards as well. In spite of this, 
Theorem 3 is not very useful in our “condensed” games. Consider e.g. the 
three Ramsey type games Ro(N,n), W(N,n) and H J(d,n). We leave to 
the reader to check that in the cases of Ramsey Game R2(N,n) and Van 
der Waerden Game, Theorem 3 gives a ridiculously weak bound on N (like if 
N < const-n then the game is a draw). For the Hales-Jewett Game H J(d,n), 
in their fundamental paper [21], Hales and Jewett proved, by using Theorem 3 
that if 


n>3%—1 (nodd) and n>2**!~—2 (n even) (4) 


then the game is a draw. They conjectured that the game is always a draw if 
there are at least twice as many cells as lines (which is a necessary condition 
for any pairing strategy). How many lines are there? It is easy to see that 
each winning line in H J(d,n) is determined by the two cells which extend 
the line into the surrounding cube 


(n +2) x (n+2)x +++ x (n +2) =(n42)%. 
So the total number of lines is 


(n +2)? —n# 
= ) 


Therefore, the Hales-Jewett conjecture is that the game is a draw whenever 
n? > (n4+2)*=n7, (6) 


that is, 


Since 


(6) is equivalent to 


(compare it to the weak bounds in (4)). 
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2. Weight Function Strategies: A Fake Probabilistic 
Method 


In 1973 a really useful sufficient condition for second-player’s draw was found 
by Erdés and Selfridge [20]. Opposite to the pairing strategy and its variants, 
where we decompose the position into several non-interacting small local 
games, the Erdds-Selfridge approach is global. 


Theorem 4. I[f 


Serie 


ACF 


then the second player can force a draw in the generalized tic-tac-toe game 
on (X,F). 


Corollary 2. If F is n-uniform and |F| < 2"~!, then the game is a draw. 


The proof of Theorem 4 is based on an exponential weight function 
technique. It is often called as “derandomization of the first moment method”: 
indeed, the criterion in Theorem 4 ensures that in a standard random 2- 
coloring of X the expected number of monochromatic members A € F is less 
than 1. (If the expected number of monochromatic sets is less than one, then 
of course there exists a “good” 2-coloring in the sense that no monochromatic 
set shows up (see Erdés [18]). This gives at least a chance for a drawing 
strategy. The message of Theorem 4 is that under the global condition above 
this drawing strategy really exists. Note, however, that the usual 3 x 3 tic- 
tac-toe gives a family of 8 triplets, for which “good” 2-coloring exists, but 
the second player cannot prevent the first one from completing a winning 
triplet.) 


Proof of Theorem 4. We in fact prove a stronger statement: under the 
condition of Theorem 4, the second player can prevent his opponent from 
completing any winning set A € ¥. According this, we call the first and 
second players “Maker” and “Breaker” , respectively. 

Given any family G of subsets of X we assign the total weight 


7(9) = Soo 
AGG 
to G. Consider now a play on our family F in which the points 
V1, W1, V2, W2,...E X 


were picked by the two players in this order. After Maker’s ith move, define 
the “truncated family” F; for i > 1 as follows. Throw away those sets from 
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F which contain any point picked by Breaker (“dead sets”), and from the 
remaining sets (“survivors”) throw away the points picked by Maker, i.e., 


F;, ={A\ {u1,...,u}:A4€F and An {wi,...,wj_-1} = 0}. 


Maker wins if and only if some of the F;’s contain the empty set, and since 
the cardinality of the empty set is zero, in this case T(F;) > 2-° = 1. Thus 
if T(F;) < 1 for every 1 > 1, then Breaker wins. 

We define a strategy for Breaker. Let the weight of a set A € F; be 27/41, 
and the weight of a point of F; be the sum of the weights of the sets in F; it 
belongs to. In his ith move Breaker picks that point of F; which is of largest 
weight. We claim that 


DF) Ps) 


independently of Maker’s (¢ + 1)st move. If we prove this the result follows 
since every set of F, contains at most one point of Maker, so by hypothesis 
T(F,) < 2-5 =1. Therefore, T(F;) < 1 for every i > 1. 

We check T(Fj41) < T(F;). Right after Breaker’s ith move (that is, before 
Maker’s (¢+1)st move), the sum of the weights of the sets is T(F;)—W where 
W is the weight of the 7th point of Breaker. On Maker’s next move he doubles 
the weight of each “surviving” set containing his (i + 1)st point v;41, so he 
adds to T(F;)—W no more than the previous weight W’ of v;41. But Breaker’s 
ith move was a point of largest weight, so W > W’. It follows that 


T(Fis1) < T(Fi;) -W+W' <T(F) 


which was to be proved. 


Note that the strategy above has a probabilistic interpretation: it is 
a greedy algorithm to minimize certain conditional probabilities—see e.g. 
Chap.15 in Alon-Spencer [1]. This weight function method is our most 
important tool to handle Tic-Tac-Toe-like Games. 

Note that Corollary 2 is sharp: the full branches of a binary tree with 
n levels form an n-uniform family of 2”~! winning sets such that the first 
player has an easy win in n steps. 

In opposition to Theorem 3, Corollary 2 gives quite good results in 
Ramsey type games. In the Ramsey Game Ro(N,n), the criterion holds if 


(*) <2(3)-2, 


R2(N,n) is a draw if N < 2"/?, (8) 


so the game 


In the other direction, by Theorem 2 and (2), 
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the first player can win the R2(N,n) game if N > 4”. (9) 


These are the best results known at present. 

Next consider the Van der Waerden Game W(N,n). Since the number 
of n-term arithmetic progressions in {0,1,..., N — 1} is less than N?/n, the 
criterion applies if N?/n < 2”~1. This holds if 


N<Vn-2°3, (10) 


so the game W(N,7) is a draw if (10) is satisfied. 
By employing some special properties of arithmetic progressions, we can 
improve on this bound (see [4]): if 


N < (2—«e)”" and n> no(e), (11) 


then the W(N,n) game is a draw. This is the best known estimation. 
Finally consider the Hales-Jewett Game H J(d,n). The Erdés-Selfridge 
criterion holds if 


(n + a7 _ n4 < gn-l 
2) ? 
that is, the second player can force a draw whenever n > const-d-logd. This 
result just falls short of conjecture (7) of Hales and Jewett that the game is 
a draw if n ~ 2d/log2. The family of winning sets in H J(d,n), however, 
has an important additional feature: any two winning lines have at most one 
point in common. Set-systems with this property are called almost disjoint. 
For almost disjoint set-systems the upper bound 2”~! of Corollary 2 can be 
raised considerably (see [3]). 


Theorem 5. There is an absolute constant c > 0 such that for every n- 
uniform almost disjoint family F of winning sets, if 


[Fl ae 


then the second player can force a draw in the generalized tic-tac-toe game 
on (X,F). 


Note that this theorem is also sharp as far as the order of magnitude is 
concerned. Erdés constructed a 3-chromatic n-uniform almost-disjoint set- 
system F* with no more than n*- 4” n-sets (see [19]). In view of Theorem 4, 
the generalized tic-tac-toe game played on this F* is a first-player win. 

Though the family of n-term arithmetic progressions in the Van der Waer- 
den game W(N,n) is not almost disjoint, the situation strongly resembles 
to almost disjointness. This is why we can employ the proof-technique of 
Theorem 5 to the game W(N,n), and can jump in (10)—(11) from 2”/? up to 
(2—e)”. 
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To settle conjecture (7) about the Hales-Jewett Game H J(d,n) (at least 
for sufficiently large numbers), we just need a slight refinement of Theorem 5. 
The really important property is the restriction 2"~°v” of the local size, the 
global size of the family of winning sets can be as large as (say) gn? and 
the game is still a draw (see [3]). 


Theorem 6. Let F be an n-uniform family of almost disjoint subsets of a 
finite X. Assume that every x € X is contained in at most 2°—°V” members 
of F, and |F| < ar"? Then the second player has a winning strategy in the 
generalized tic-tac-toe game on (X,F). 


This result can be interpreted as a sort of “game-theoretic local lemma”, 
at least for almost disjoint systems. 

The proofs of Theorems 5-6 are based on the same idea. We split the 
game into two non-interacting parts, what we call the “BIG GAME” (a 
shrinking game) and the “small game” (a growing game). The “BIG SETS” 
are the unions of some “connected” subfamilies of F. The “small sets” are the 
available parts of those “dangerous” members of F, which are almost entirely 
occupied by the first player and are not blocked by the second player yet. 
If the family of “small sets” is “sparse”, then the second player can employ 
Theorem 3 to block them. The “BIG GAME” plays an auxiliary role here: by 
using the strategy of Theorem 4, the second player can force that the “small 
game” is always “sparse”, so that pairing strategy really works. 

It is easy to see that in the Hales-Jewett Game H J(d,n), every cell of 
the board n x --- x n = n@ is contained by at most (34 — 1)/2 winning lines. 
So Theorem 6 applies here if 


d 1 2 d_»d 
SD gph oh ang OFA) cont (12) 
Inequalities (12) trivially hold if 
log 3 
n> (PES te) d and n > nol). (13) 


Observe that (13) is asymptotically better than Hales-Jewett conjecture (7). 
That is, the second player can force a draw even when pairing strategy simply 
cannot exist. 


3. Maker-Breaker Version 
As we have seen in Sect.1, the second player has no chance to win a 


generalized tic-tac-toe game against a perfect first player. Then, why does 
he not just concentrate on preventing his opponent from a win? We thus can 
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name him the Breaker, and the other one the Maker. We can even take this 
one step further, and consider the Maker-Breaker game even if the Breaker 
moves first. 

On the same system (X, 7) we can play the symmetric generalized tic- 
tac-toe game, and also the asymmetric Maker-Breaker version, where Maker’s 
aim is to pick every element of a winning set A € ¥, and Breaker’s aim is to 
prevent Maker from doing so. The winner is the one who achieves his goal 
(so a draw is impossible). 

If Breaker = second-player wins the Maker-Breaker version on a system 
(X, F), then the same play gives him, as a second player, (at least) a draw 
in the tic-tac-toe version on (X,F). So from Theorem 2 immediately follows 


Corollary 3. Assume that X is finite, and the family F of winning sets 
has chromatic number at least 3. Then Maker = first-player has a winning 
strategy in the Maker-Breaker version on (X,F). 


The converse is not true. It is possible that Maker =first-player wins 
the Maker-Breaker version while Breaker = second-player can force a draw 
in the tic-tac-toe version. This happens for example in the usual 3 x 3 tic- 
tac-toe: the original game is a draw but the Maker-Breaker version is a win 
for Maker = first-player. It is true, however, that the same criterions as in 
Theorems 4-6 are sufficient for Breaker = second-player to win the Maker- 
Breaker version (and so they are automatically sufficient for Breaker = first- 
player to win as well). 

The proof of Theorem 4 actually gives the stronger 


Theorem 7. Assume that X is finite, and 


D2 hs . 


ACF 


Then Breaker = second-player has a winning strategy in the Maker-Breaker 
version on (X,F). 


Similarly, we have (see [3]) 


Theorem 8. Assume that F is an n-uniform almost disjoint family of 
subsets of a finite X. Suppose further that every x € X is contained in at 
most 2"-°V" members of F, and |F| < ar"? Then Maker = second-player 
has a winning strategy in the Maker-Breaker version on (X,F). 


While playing the tic-tac-toe version on an (X,F), both players have 
their own threats, and either of them, fending off the other’s, may build his 
own winning set. Therefore, a play is a delicate balancing between threats 
and counter-threats and can be of very intricate structure even if the system 
(X, F) of the game is simple. 
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The Maker-Breaker version is usually simpler. Maker doesn’t have to 
waste valuable moves fending off his opponent’s threats. He can simply 
concentrate on his own goal. This is why we have a surprisingly simple 
sufficient condition for Maker’s win (see [4]), which is usually far better 
than Corollary 3 (see Theorem 9 below). Note however that, though the 
Maker-Breaker version is usually simpler, it is still “very hard”. For example, 
Hex is equivalent to a Maker-Breaker version: Maker = first-player wants a 
connecting chain, and Breaker = second-player simply wants to prevent his 
opponent from achieving his goal. 


Theorem 9. Assume that F is n-uniform, and any point in X is contained 
by less than half of the members of F. Suppose moreover that, fixing any two 
elements of the board X, no more than d winning sets from F contain both 
of them. If 


IF] 22"? -d- |X], 


then Maker has a winning strategy in the Maker-Breaker version on (X,F), 
and it doesn’t matter at all that Marker is the first or the second player. 


Proof. We consider the worse case where Maker is the second player. Here 
Maker “borrows” Breaker’s weight function strategy from the proof of 
Theorem 4. 

Given any family G of subsets of X we assign the total weight 


T(G) = S02 


AGG 


to G. Consider now a play on our family F in which the points 
U1, W1, V2, W2,--. E xX 


were picked by the two players in this order. After Breaker’s ith move, define 
the “truncated family” F; for i > 1 as follows. Throw away those sets from 
F which contain any point picked by Breaker (“dead sets”), and from the 
remaining sets (“survivors”) throw away the points picked by Maker, i.e., 


F;, ={A\ {wi,...,wi-1}:A€F and An{u,..., vi} = Of. 


Maker wins if the last total sum T(Fenqa) is still positive, that is, if at the 
end of the game there is a “survivor”. 

We define the winning strategy for Maker=second-player as follows. 
Let the weight of a set A € F; be 27!4!, and the weight of a point of F; 
be the sum of the weights of the sets in F; it belongs to. In his ith move 
Maker picks that point of F; which is of largest weight. We claim that 


T(Fiti) = T (Fi) — 


BIL Q 
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independently of Breaker’s (i + 1)st move. If we prove this, the result follows 
since every set of F, contains at most one point of Breaker, namely v,, so by 
hypothesis T(¥,) > |F|/2-2~”. Therefore, 


TF pa) > Fi ae 8, 


and Maker wins. 

We check T(F;41) > T(F;) — ¢. Right after Maker’s ith move (that is, 
before Breaker’s (+ 1)st move), he doubles the weight of each “surviving” set 
containing his ith point w;, so he adds to T(F;) the weight W of his ith point 
w;. On Breaker’s next move he subtracts the new weight of each “surviving” 
set containing his i + 1st point v;41, so he subtracts from T(F;) + W the 
previous weight W’ of vi+1, and also the previous weight of those “surviving” 
sets which contain both w; and vj+1: 


whe SD Wl, 


BEF; 
{wi,vigi} CB 


By hypothesis, W” < d/4. Since Maker’s ith move was a point of largest 
weight, so W > W’. It follows that 


Tha) AR) WW -W" err) 


>) 


BIL Q 


which was to be proved. 


If F is almost disjoint (that is, any two different winning sets have at 
most one common element), then fixing two elements of the board X, no 
more than one winning set can contain both of them. So we have 


Corollary 4. If F is an n-uniform almost disjoint system such that any 
point of X is contained in less than half of the members of F, and |F| > 
2"-2|X|, then Maker has a winning strategy in the Maker-Breaker version 
on (X,F), and it doesn’t matter that Maker is the first or the second player. 


To appreciate Theorem 9, we study the Maker-Breaker versions of the 
Ramsey type games R;y(N,n), W(N,n) and H J(d,n). Note that statement 
(iv) below is a straightforward consequence of Theorem 9 (see [4]). 


Theorem 10. Consider first the Maker-Breaker version of the Ramsey 
Game R2(N,n): 


(i) If N < 2"/? then Breaker; 
(ii) If N > (2+.e)” then Maker 
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has a winning strategy. For every k > 3 there are positive constants cy and 
c, such that, in the Maker-Breaker version of R;,(N,n): 


(ii) If N< acer" then Breaker; 
(iv) If N> 2°" then Maker 


has a winning strategy. 


Comparing Theorems 2 and 10 we see that the breaking point for the 
Maker-Breaker version is within much-much-much closer bounds than that 
of the tic-tac-toe version. For a general k, the former one is between (roughly) 
ar" and 2”, and the latter one is between 2” and the tower 


exp (exp (---exp(n)---)) 


of height k. Moreover Theorem 10(iv) in fact proves that, in the Maker- 
Breaker version of the Ramsey Game R,(N,n), Corollary 3 (= Ramsey 
theory) fails to give the true order of magnitude of the breaking point. Indeed, 
the Ramsey threshold number r;(n) is known to be bigger than the tower 


exp (exp (---exp(m) --- )) (14) 


of height & — 1 (see (3)), but Maker = first-player can win around N = gn 
which is asymptotically much smaller than (14) if k > 4. 

The difference between the Maker-Breaker version and the tic-tac-toe 
version of the Van der Waerden Game is even more dramatic (see [4]). 


Theorem 11. Let ¢ > 0 be arbitrary. If N is large enough depending only 
on €, then in the Maker-Breaker version of the Van der Waerden Game 
W(N,n): 


(i) If N < (2—e)” then Breaker; 
(ii) If N > 3-2” then Maker 


has a winning strategy. 


By Theorems 2 and 11, the breaking point in the Maker-Breaker version 
is around 2”, but in the tic-tac-toe version we cannot even prove the upper 
bound 


exp (exp (---exp(n)---)), 


which is a tower of height n. 

The situation is very similar in the Hales-Jewett Game: the best known 
upper bounds in the two versions are an astronomical distance from each 
other. 
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Theorem 12. Let ¢ > 0 be arbitrary. In the Maker-Breaker version of the 
Hales-Jewett Game H J(d,n): 


(i) Ifn< aos then Maker; 


(ii) Ifn> (25 + e)d then Breaker 


has a winning strategy (if n is large enough). 


Finally, we return to Hex. Nobody knows an explicit winning strategy 
for Hex, but there is a large class of very similar games for which explicit 
winning strategies were found. 


Lehman’s Theory of “the Shannon Switching Game” 


The Shannon Switching Game is played on a graph representing an electrical 
network in which certain nodes are labelled + and some others are labelled —. 
Each edge (begin the game with them drawn in pencil) represents a 
permissible connexion between the nodes at its ends. Maker, at his move may 
establish one of these connexions permanently (ink over a penciled edge) and 
attempts to form a chain between some + node and a — one. His opponent, 
Breaker may permanently prevent a possible connexion (erase a penciled 
edge) and tries to separate + from — forever. You can always suppose that 
there is only one positive node and one negative one by making identifications. 

Supposing this, Alfred Lehman has proved that Maker can win as second 
player if and only if he can find two edge-disjoint trees which each contain all 
the nodes of some subgraph containing + and —. The “only if” part is hard, 
but there is an easy explicit strategy which proves “if”: whenever Breaker’s 
move separates one of the trees into two parts, say A and B, Maker makes a 
move on the other tree joining a vertex of A to one of B. 

The game can be generalized to make the winning configurations for 
Maker just those which contain a specified family P of sets of edges. (In the 
original game P was the family of all paths from+to—.) Lehman proves 
the “only if” part of this theorem by taking P to be the family of all trees 
containing every vertex (spanning trees). 

If Maker, as second player, has a win in the modified game, then by the 
strategy stealing argument there must be two edge-disjoint spanning trees. 
For since an extra move is no disadvantage, both players can play Maker’s 
strategy. If they do this, two spanning trees will be established, using disjoint 
sets of edges. Conversely, if two such trees exist, our previous strategy for 
Maker actually wins for him as second player, even in the modified game. 

The more detailed part of Lehman’s argument establishes that, in a 
suitable sense, the modified game reduces to the original one. 
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4. Complex Graph Games and Random Graphs 


When a 1-1 game is overwhelmingly in favor of one of the players one can 
make up for this handicap by allowing the other player to claim many points 
in amove. For example, when played on a large complete graph K,,, Lehman’s 
game (= Maker wants a spanning tree) is overwhelmingly in favor of Maker. 
So we allow Breaker to claim many edges per move: let each move of Breaker 
consist of claiming b previously unclaimed edges. Clearly if b is large with 
respect to n then Breaker wins, if b is small with respect to n then Maker 
has a win. In the biased case Lehman’s criterion above hopelessly breaks 
down. However, the following heuristic argument, due to P. Erdés, suggests 
the game-theoretic threshold point has to come around n/ log n. 

During a play Maker takes = a edges. In particular, if 6 = Taeee then 
Maker creates a graph with ~ c-n-logn edges. A well-known theorem from 
the theory of Random Graphs states that a random graph with n points and 
c:-n-logn edges is “almost certainly” connected for c > 1/2 and “almost 
certainly” disconnected for c < 1/2. It turns out that the game theoretic 
breaking point does indeed come around b = n/logn; more precisely, it is 
between (log 2 —¢)n/logn and (1+ ¢)n/logn for all sufficiently large values 
of n. 

Let us return to the random graph with n points and c-n- logn 
edges. Around c = 1/2 the random graph undergoes a remarkable change: 
If c < 1/2 then it has plenty of isolated points; if c > 1/2 then it contains 
a Hamiltonian cycle (which trivially implies connectivity). The fundamental 
difference between connectivity and the existence of Hamiltonian cycles is 
that the former property can be efficiently (i.e., the “worst case” running time 
is a polynomial function of the input data) proved if it holds and efficiently 
disproved if it does not, but nobody knows an efficient way of disproving 
that a graph contains Hamiltonian cycles. We can overcome this technical 
difficulty, and are able to show that the “probabilistic intuition” works: the 
breaking point for the Hamiltonian Cycle Game comes around b = n/logn. 
More precisely, if b = (982 _ 6) then Maker can build up a Hamiltonian 
cycle of his own. 

In general, we shall examine the following class of graph games. Two 
players, Breaker and Maker, with Breaker going first, play on the complete 
graph K,, of n vertices in such a way that Breaker claims b(> 1) previously 
unselected edges per move and Maker claims one previously unselected edge 
per move. Maker wins if he claims all the edges of some graph from a family of 
prescribed subgraphs of K,,. Otherwise Breaker wins, that is, Breaker simply 
wants to prevent Maker from doing his job. 

Let Clique(n;b,1;7r) denote the game where Maker wants a complete 
subgraph of r vertices (from his own edges of course). Denote by Connect 
(n;b,1) and Hamilt(n;b,1) the games where Maker’s goal is to select a 
spanning tree (i.e. a connected subgraph of K,,) and a Hamiltonian cycle of 
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Ky, respectively. It is well-known that the largest clique in a random graph of 
n vertices and of parameter p = 1/2 has approximately 2 log n/ log 2 vertices 
with probability tending to one as n tends to infinity. In view of Erdés’ 
heuristic, this suggests that Maker wins the fair game Clique(n;1,1;r) if r is 
around logn. The following result is just a reformulation of Theorem 10(i)— 
(ii) (see Erdés-Selfridge [20] and Beck [4]). 


Theorem 13. Breaker has a winning strategy in the fair game 
Clique(n; 1, 1; 2 log n/ log 2). 
On the other hand, Maker has a winning strategy in 
Clique(n; 1,1; (1 — €) log n/ log 2) 
if n is large enough depending on € > 0 only. 
The next result is due to Erdés-Chvatal [15] and Beck [5, 7]. 
Theorem 14. We have: 
(i) Ifb > (1+e)n/logn 
then Breaker has a winning strategy in Connect(n;b,1) ifn is large enough. 
(ii) If b > (log2 — €)n/logn 
then Maker has a winning strategy in Connect(n; b,1) ifn is large enough. 


(itt) If b > (log 2/27 — €)n/logn 


then Maker has a winning strategy in Hamilt(n;b,1) if n is large enough. 


Next we point out further instances of this exciting analogy between the 
evolution of random graphs and biased graph games (see [10]). The basic 
idea is that Maker’s graph possesses some fundamental properties of random 
graphs (mostly “expandability” type properties) provided Maker uses his best 
possible strategy. 

Let us begin with a trivial observation: if b = 2n then Breaker can easily 
prevent Maker even from getting a path of two edges (Breaker blocks the 
two endpoints of Maker’s edge). If b = en, « > 0 constant, then, in view 
of Theorem 14(i), Breaker can force the disconnectivity of Maker’s graph. 
In fact, Breaker can force at least Ee-en isolated points in Breaker’s graph, 
and it goes as follows (the following argument is a straightforward adaptation 
of the Erdés-Chvatal proof of Theorem 14(i)). Breaker proceeds in two stages. 
In the first stage, he will claim all the edges of some clique K*, with m = en/2 
vertices such that none of the Maker’s edges has an endpoint in this K;>,. 
In the second stage, he will claim all the remaining edges incident with at 
least fe-=en vertices of kK}, thereby forcing at least Ee-en isolated points in 
Maker’s graph. 
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The first stage lasts no more than m = en/2 moves, and goes by a simple 
induction by m. During his first i—1 (1 < 7 < m) moves, Breaker has created 
a clique Kj_, with 1 — 1 vertices such that none of the Maker’s edges has an 
endpoint in A_,. At this moment there are i—1 < en/2 Maker’s edges, hence 
there are at least two vertices u, v in the complement of V(A}_,) which are 
incident with none of the Maker’s edges. On his ith move, Breaker claims 
edge {u,v} and all the edges joining wu and v to the vertices of K7_,, thereby 
enlarging K;*_, by two vertices. Then Maker can kill one vertex from this 
clique Kj, by claiming an edge incident with that vertex. Nevertheless, a 
clique of 2 vertices still “survives”. 

In the second stage, Breaker has m = en/2 pairwise disjoint edge-sets: 
for every u € V(K;.), the edges joining wu to all vertices in the complement of 
V (kK; ). It is easy to see that Breaker can completely occupy at least e-=mof 
these m disjoint edge-sets by the simple rule that he has the same (or almost 
the same) number of edges from all the “surviving” edge-sets at any time. 

The first result says that Maker is able to build up a cycle of length 
>(1- 7 RE )n. That is, if Breaker claims en edges per move, then Maker 
has an “almost Hamiltonian cycle” in the sense that the complement is 
“exponentially small” (the constant factor of 200 in the exponent is of course 
very far from the best possible—here we don’t make any effort to find the 
optimal, or at least nearly optimal, constants). 


Theorem 15. If 0 <« < 1/200 and Breaker selects en edges per move then 
Maker can build up a cycle of length > (1 — e7 200 )n on a board Ky. 


Note that if Maker just wants a path P,, of m = (1 — const,/e)n edges, 
then he can do it in the shortest way: in m moves. Indeed, Maker can 
employ the following simple greedy strategy: he keeps extending his path by 
adding that available point (as a new endpoint) which has minimum degree in 
Breaker’s graph. Trivial calculation shows that this greedy procedure doesn’t 
terminate in m = (1 — const - \/é)n moves. 

Observe that if ¢ = then Theorem 15 gives the order of magnitude 
of Theorem 14 (iii). 

Next question: what can we say about Maker’s largest degree (i.e. Maker’s 
largest star)? Obviously Breaker can prevent Maker from having a degree 
larger than 2N/b on a board Ky: if Maker selects an edge {u, v} then Breaker 
occupies b/2 edges from u and b/2 edges from v. 

For simplicity restrict ourselves to the fair case (i.e. b = 1). As far as I 
know this problem is due to Erdés (oral communication). Ldszlé Székely [27] 
proved (by using Lemma 3 in Beck [4]) that Breaker can prevent a star of 
size 5 + const: /nlogn. In the other direction, we can prove that Maker can 
achieve a star of size }+const-,/n. If the complete graph K;, is replaced with 
the complete bipartite graph K,,,, then we obtain the following interesting 
“row-column game”. 


goal 8, 
200 log n 
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Theorem 16. Consider the game where the board is an n x n chessboard. 
Breaker and Maker alternately select one previously unselected cell. Breaker 
marks his cells blue and Maker marks his cells red. Maker’s object is to achieve 
at least } +k (k > 1) red cells in some line (= row or column). If k = va 
then Maker has a winning strategy. 


Note that the proof of Theorem 16 is based on a game theoretic second 
moment method. This explains the “standard random fluctuation” of size ./n. 

Theorem 16 is in sharp contrast with the chessboard type alternating 
two-coloring, where the discrepancy in every line is 0 or 1 depending on the 
parity of n. 

Note that a straightforward modification of the proof of Theorem 16 gives 
the above-mentioned case where the board is Ky. 


Proof of Theorem 15. Given a simple and undirected graph G, and an 
arbitrary subset S of the vertex-set V(G) of G, denote by T¢(S$) the set 
of vertices in G adjacent to at least one vertex of S. Let |.S| denote the 
number of elements of a set S. 

The following lemma is essentially due to Pésa [26] (a weaker version was 
earlier proved by Komlés-Szemerédi [22]). A trivial corollary of the lemma is 
that an expander graph has a long path. 


Lemma 1. Let G be a non-empty graph, v9 € V(G) and consider a path 
P = (v0, U1,---;Um) of maximum length which starts from vo. If (vi,;Um) € 
G(1 <i < m-—1) then we say that the path (v0,..-, Vi; Um; Um—15 +++; Vi41) 
arises by Pésa-deformation from P. Let end(G, P,vp) denote the set of all 
endpoints of paths arising by repeated Pésa-deformation from P (the starting 
point vp is always fixed). Assume that for each vertex-set S C V(G) with 
[S| < k, |Te(S) \ S| = 2|S|. Then |end(P, G,vo)| > k +1. 


In order to use Lemma 1, we need 


Lemma 2. Under the hypothesis of Theorem 15, Maker can guarantee, that 
right after Breaker occupied Hi 1) edges, Maker’s graph satisfies the following 
property. Property A: For any vertex-set S of Ky with Le- x02 n < |S| < n/4, 
ICa(S) \ S| > 2|S|+ e~20n where G is Maker’s graph right after Breaker 


claimed 35 (5) edges. 


Proof. We employ a general theorem concerning hypergraph games. Let H be 
a hypergraph with vertex-set V(#) and hyperedge-set E(H), and let p > 1 
and q > 1 be integers. A(H;p, 1; ¢)-game is a game on H in which two players, 
I and II, select p and 1 previously unselected vertices per move from V(#). 
The game proceeds until alV(H)| vertices has been selected by I. II wins if 
he occupies at least one vertex from every hyperedge A € E(H); otherwise 
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I wins. In [7] we proved the following generalization of the Erdés-Selfridge 
criterion: If 


AE€E(H) 


then II has a winning strategy in the (H; p, 1; q)-game. 
In order to employ (13), we introduce some hypergraphs. Let m be an 
integer satisfying }e~20n < m < n/4, and let H(n;m) be the set of all 


complete mx (n—3m—e— m0E n+ 1) bipartite subgraphs of K,,. The “vertices” 
of H(n;m) are the edges of K,,. 

Now to ensure property A, in view of (15) with p = b = en and q = 20, 
it is enough to check the following inequality: 


n/4 
S n n- o g—m(n—3m—e~ THE n+1)/20en < i 
: m)] \2m + e7 %0en — 1 2 


(16) 


A standard calculation shows that (16) holds, and Lemma 2 follows from (15) 
and (16). 


Now we are ready to complete the proof of Theorem 15. We show that if 
Maker uses the strategy in Lemma 2, then H =“Maker’s graph at the end” 


contains a cycle of (1 -—e7 nite) n edges. 


n 


Let G = “Maker’s graph right after Breaker occupied 30 (5) edges”. 


Assume that there exists a vertex-set 5; C V(K,,) with |Si| < Lem n 
such that |[¢(S1) \ Si] < 2|S1|. Throwing away the vertices 'g(S1) U Si 
from G,, we get a new graph G,. Again assume that there exists a vertex-set 
So C V(G1) with |S2| < Le- zen such that |g, (S2) \S2| < 2|S2|. Throwing 
away the vertices 'g, ($2) U S2 from G1, we get a new graph Gg, and so on. 
This truncation procedure terminates (say) in t steps: G; = Gyi1 =-:-. That 


is, for any vertex-set S C V(G;) with || < Lean, 


IT'e,(5) \ 5] 2 215]. (17) 
We claim 
IV (Gt)| > (1 - ei ) n. (18) 


Indeed, otherwise there is an index i(< t) such that at the ith stage of the 
; 1 
truncation, the union set S = 9, U---US; first satisfies 4e~ 2006 n < |S], so 


3 
1 ne 
Ze 200 N <|S|< 2e~ 0 Nn <n/4 and 
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IPa(S) \ S| < 2|S], 


which contradicts property A in Lemma 2. 
It follows from (17), (18) and property A that for every vertex-set S' C 
V(G:) with |S] < n/4, 


ITe,(S) \ S|] 2 2|5]. (19) 


It immediately follows from (19) that G; is a connected graph. We are 
going to show that Maker can build up a Hamiltonian cycle on the vertex-set 

Let P be a path in G; of maximum length. Inequality (19) ensures that 
the truncated graph G; satisfies the condition of Pésa’s lemma with k = n/4, 
so (see Lemma 1) | end(G;, P, vo)| > n/4 where vp is one of the endpoints of P. 

Let end(G;, P,vo) = {x1,22,...,c%} (k > n/4), and denote by P(z;), 
1<i<ka path arising from P by a sequence of Pésa-deformations (see 
Lemma 1), vo is fixed and having other endpoint x;. By Lemma 1, for every 
x; € end(G:, P, vu), 


|end(G:, P(a;),x;)| > n/4. (20) 
Let 
close(G,, P) = {(a;, y) : vj € end(G:, P, vo), y € end(G;, P(x;), x;)}. 


By (20) we have |close(G;,P)| > (n/4)?/2 = n?/32. Since at this 
moment Breaker’s graph contains a edges, there must exist a previously 
unselected edge e; in close(G;,P). Let e; be Maker’s next move. Then 
Maker’s graph Gy) = G;, U {e1} contains a cycle of length |P|. Moreover, 
Gg?) = G; U {e1} is connected, thus either |P| = |V(G;)|, and we have a 
Hamiltonian cycle in the truncated vertex-set, or Gy contains a longer path 
(i.e. a path of length > |P| + 1). 

Let P, be a path of maximum length in am, Repeating the argument 
above, we obtain that | close(G‘”, P,)| > n?/32. Since at this moment 

1 


Breaker’s graph contains 30 (3) +en < n?/32 edges, there must exist a 


previously unselected edge e2 in close(G? , P,). Let e2 be Maker’s next move. 
Then Maker’s graph Gg?) = G, U {e1,e2} contains a cycle of length | P|. 
Moreover, a?) = G; U {e1, e2} is connected, thus either |P,| = |V(G:)|, and 
we have a Hamiltonian cycle in the truncated vertex-set, or G;’ contains a 
longer path (i.e. a path of length > |P,|+1). By repeated application of this 
procedure, in less than n moves (so the required inequality 0 (5) tn-ren< 
n? /32 holds), Maker’s graph will certainly contain a Hamiltonian cycle in the 
truncated vertex-set V(G,). Theorem 15 follows. 
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Proof of Theorem 16 (a “fake second moment” method). Consider a play ac- 
cording to the rules. Let x1, 22,...,2; be the blue cells in the chessboard 
selected by Breaker in his first 7 moves, and let y1,y2,...,y;—-1 be the red 
cells selected by Maker in his first (¢ — 1) moves, and the question is how to 
find Maker’s optimal ith move y;. Write 

X; = {x1, 22, sae xi} and Yj-1 = {yi, Y2,--- ,Yi-1}- 


Let A be a line (= row or column) of the n x n chessboard, and introduce the 
the “weight” : 


+ 
wid) = {lan Kal -l4nx+ Eh 


where 


0, otherwise. 


Or tee 


Let y be an arbitrary unselected cell, and write 
wily) = wi(A) + wi(B) 


where A and B are the row and the column containing y. 
Here is Maker’s winning strategy: at his ith move he selects that 
previously unselected cell y for which the maximum of the “weights” 


y ee a a (y) 


is attained. 
The following total sum is a sort of “variance”: 


T= S> (wi(A))?. 
2n lines A 


The idea of the proof is to study the behaviour of T; as i = 1,2,3,... and to 
show that Teng is “large”. 


Remark 1. The more natural “symmetric” total sum 
do (%-19 Al - |X Al)” 
2n lines A 


is useless because it can be large if in some line Breaker overwhelmingly 
dominates. This is exactly the reason why we had to introduce the “shifted 
and truncated weight” w;(A). 
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First we compare T; and T;,1, that is, we study the effects of the cells y; 
and 2; ,. We distinguish two cases. 


Case 1: the cells y; and x;41 determine four different lines. 
Case 2: the cells y; and x;,; determine three different lines. 


In Case 1, an easy analysis shows that 
Tug > TA (21) 
except for the “unlikely situation” when w;(y;) = 0. Indeed, 
wi(yi) = wi(A) + wi(B) = wi(titi1) = wi(C) + wi(D), 
and so 
Tia = T; + 2wi(yi) — 2wi(vi41) + {2 or 1 or 0} > T; + {2 or 1 or O} 
where 


2, if w;(A) > 0,w;(B) > 0; 
{2 or 1 or 0} = § 1, if max{w;(A), w;(B)} > 0, min{w;(A), w;(B)} = 0; 


Even if the “unlikely situation” occurs, we have at least equality: Tj41 = Tj. 
Because y; was a cell of maximum weight, for x;41, and for every other 
unselected cell x, w;(x) = 0. 

Similarly, in Case 2, 


Ty41 > T;, +1 (22) 


except for the following “unlikely situation”: w;(B) = 0 where A is the line 
containing both y; and 2,41, and B is the other line containing y;. Even if 
this “unlikely situation” occurs, we have at least equality: T;11 = T;. Because 
y; was a cell of maximum weight, it follows that w;(C) = 0 where C is the 
other line containing 7;,,, and similarly, for every other unselected cell x in 
line A, w;(D,) = 0 where D, is the other line containing «. 

If ¢ is an index for which the “unlikely situation” in Case 1 occurs, 
let unsel(i) denote the set of all unselected cells after Breaker’s ith move. 
Similarly, if 2 is an index for which the “unlikely situation” in Case 2 occurs, 
let unsel(7, A) denote the set of all unselected cells after Breaker’s (4 + 1)st 
move in line A containing both y; and x44, including y; and 7;41. 

If the “unlikely situation” occurs in less than 3n?/10 moves (i.e. in less 
than 60% of the total time), then we are trivially done. Indeed, then by (21) 
and (22), 

2 


Tend — Th2/2 2 = 
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Since T.,q is a sum of 2n terms, we have 


2 nm 
gnmax , (wna/2(A))” 2 > = Gp: 


That is, for some line A, 


—) + 
nm 
una/alA) = {IAN ¥saja-al = [AN Xnaal + EL" > Ynfio 


where 
a, if a> 0; 
{a}? = 
0, otherwise. 
So 
Aneel Aneel s/ao- vn > = 


and Theorem 16 follows. 

If the “unlikely situation” in Case 1 occurs in more than n?/10 moves 
(i.e. in more than 20% of the time), then let io be the first time when this 
happens. Clearly 


| unsel(ig)| > 2n?/10 = n?/5. 


It follows that there are at least (n?/5)/n = n/5 distinct columns D 
containing (at least one) element of unsel(ig) each. So w;(D) = 0 for at 
least n/5 columns D, that is, 

IDA X;| —|DAY;-1| > ue 


for at least n/5 columns D. Therefore, after Breaker’s ioth move, 


S> {IPN X|-|DA¥ajt > 2. (23) 
n columns D 5 4 
Since 
1+ SO {jDN¥al-|DNXIt= So {\DnXiJ-|DNY-al}, 
n columns D n columns D 
by (23), 


3/2 
YY (PA¥al)-|DXI} > 


n columns D 
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Since the number of terms on the left-side is less than n — n/5 = 4n/5, after 
Breaker’s zgth move we have, 


n3/2/20 Jn 
4n/5 16° 


max{|D NY¥;-1| -—|DN Xi} > 


Obviously Maker can keep this advantage of ,/n/16 for the rest of the game, 
and again Theorem 16 follows. 

Finally, we study the case when the “unlikely situation” of Case 2 occurs 
for at least n?/5 moves (i.e., for at least 40% of the time). Without loss of 
generality, we can assume that there are at least n?/10 “unlikely” indices 7 
when the line A containing both y; and x41 is a row. We claim that there is 
an “unlikely” index 79 when 


|unsel(t9, A)| > n/5. (24) 


Indeed, by choosing y; and 2;41, in each “unlikely” move the set unsel(i, A) 
is decreasing by 2, and because we have n rows, the number of “unlikely” 
indices 7 when unsel(i, A) < n/5 is altogether less than n- nls =n (10, 
Now we can finish just like before. We recall that w;,(D) = 0 for those 
columns D which contain some cell from unsel(io,A) (here A is the row 
containing both y;, and 2,41). So by (24), wi,(D) = 0 for at least n/5 
columns D, that is, 
DAx|=(DAYa|= a 


for at least n/5 columns D. Therefore, after Breaker’s ioth move, 


S- {Dn Xi] -|DAYal}t > 2. (25) 
n columns D 
Since 
14 SD (IDa¥ial-lpnxXit= SD {pn X|-|DAYi-al, 
n columns D n columns D 
by (25), 
3/2 
S- {IDNY¥.-1|—|DN Xi)}* = = 


n columns D 


Since the number of terms on the left-side is less than n — n/5 = 4n/5, after 
Breaker’s zgth move we have, 


n3/2/20 _ Jn 
4n/5 16° 


max{|D al Yj-1| _ |D (A X;|} > 
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Obviously Maker can keep this advantage of ,/n/16 for the rest of the game, 
and again Theorem 16 follows. The proof is complete. 


5. Algorithms and Complexity 


Perhaps the most interesting applications of game-theoretic ideas are in 
theoretical computer science. The following is an illustration of this. 

Let N denote, as usual, the set of natural numbers, and let K, = [N]? 
be the set of pairs, that is, an explicit representation of the infinite complete 
graph on the set of natural numbers. 

Suppose we are given a 2-coloration (red and blue) of the edges of 
K. = [N]?. The problem is to find a monochromatic complete subgraph of 
k vertices, that is, a monochromatic copy of Ky. The well-known Ramsey’s 
theorem implies the existence of such a subgraph. At each step we may ask 
the color of an arbitrary edge and we are interested in the minimum number 
of questions necessary. The standard proof of Ramsey’s theorem, the so-called 
ramification method, guarantees that we can find a monochromatic complete 
subgraph of size k within 4” steps (=questions). Here is an outline of the 
procedure. In the first 2?*-? — 1 steps we test a star, e.g. we ask for the colors 
of the edges {1,2}, {1,3},..., {1,27*-?}. Let {1,r}, r€ Rand {1,b},be B 
be the sets of red and blue edges, respectively. The sum of the cardinalities of 
R and B is precisely 2?*~? — 1, therefore one of them, say R has cardinality 
at least 2?*-3. Choosing an arbitrary element s2 € R we ask the colors of the 
274-3 _1 edges {so,r},r € R(r # 82) and so on. Finally we obtain a sequence 
81 = 1,52,...,S2%—1 of vertices such that the color of the edge {s;,s;},i <j 
depends on index i only. Let c(i) € {red, blue} be this color. Then the 
complete subgraphs induced by the vertex-sets {s; : c(t) red} U {s2,-1} and 
{s; : c(j) blue} U {s2x-1} are monochromatic. Since one of them has size at 
least k, we are done. This procedure needs at most Yo —1) <4" steps 
(= questions). 

Many years ago we proved that no method can perform much faster than 
this ramification procedure (see [6]). 


Theorem 17. An algorithm which, for any 2-coloration of K.., will deter- 
mine a monochromatic complete subgraph with k(> 3) vertices requires more 
than 2*/? steps. Here a step means asking the color of a single edge. 


Theorem 17 was suggested by a well-known result of Erdés [17] stating 
that there exists a 2-coloration of the complete graph of const -k2*/? vertices 
which does not contain a monochromatic Ky. The proof of Theorem 4 
provides an efficient algorithmic proof of Erdés’ theorem which was originally 
proved by a probabilistic argument. 

Consider the following game-theoretic equivalent of the problem above. 
Two players, I and II, are playing on the “board” K,, = [N]?. On each move, 
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Player I selects a previously unselected edge of K = [N]? and Player II 
colors it by red or blue. Player I wins if there exists a monochromatic copy 
of Ky in the graph selected by him during the play; otherwise Player II wins. 
That is, the aim of Player II is simply to prevent Player I from achieving his 
goal. Observe that Theorem 17 is equivalent to the statement: 

Player II has a strategy such that I is unable to win within 2/? moves. 

Recently Noga Alon found a very simple proof of Theorem 17. We briefly 
describe his elegant argument. We want to show that Player II can force 
r2(k)/2 questions (=moves), where r2(k) is the maximum cardinality of a 
complete 2-colored graph with no monochromatic K,. This gives, by Erdés’ 
bound r2(k) > const-k2*/?, the lower bound in Theorem 17 (in fact, a sightly 
better bound with an extra factor of k). 

Put n = re(k) and let Player II fix a 2-coloration of the edges of K, = 
{1,2,...,n}? with no monochromatic K;,. We are going to show that Player 
II can make sure that after n/2 moves the colored part of Ko = N? will be 
isomorphic to a subgraph of his own fixed 2-colored K,, = {1,2,...,n}? with 
no monochromatic Ky. So Player I cannot win within n/2 moves. 

The strategy of Player IT goes as follows: let 71, 72, 73,...,%; be the vertices 
that appear in the questions of Player I (clearly | < 2(n/2) = n, if at most 
n/2 questions have been asked), and assume this is an enumeration of these 
vertices according to the order they appear in the questions of Player I. Define 
f(i;) = j (observe that Player II knows f(2;) as soon as vertex number 
i; is asked, he does not have to wait to the next questions of Player I). 
Now, when Player I asks for the color of {i;,i;} Player II answers by telling 
the color of the edge {j,s} in his own fixed 2-colored K, = {1,2,...,n}? 
with no monochromatic K;,. This will guarantee what is needed: up to n/2 
moves the colored graph we have will be isomorphic by f to a graph with no 
monochromatic Kx. 

Alon’s argument proves that any decision tree algorithm needs r2(k)/2 
many queries in worst case. By employing the Erdds-Selfridge “derandom- 
ization” of Erdés’ probabilistic proof for r2(k) > const - k2", we get a simple 
weight-function algorithm (= strategy), and so we can essentially reduce the 
complexity of the adversary’s strategy. The “danger” of this is that the Erdés- 
Selfridge lower bound might turn out to be much smaller than r2(k). 

Note that our original proof for Theorem 17 was a weight-function 
strategy, too, but it was more complicated. 

Next we study the parallel matching complexity of the problem above, 
that is, we are interested in the minimum number of steps necessary to find 
a monochromatic K,,, where a step means asking the colors of many point- 
disjoint edges at once. First we formulate a 


Conjecture 3. A parallel matching algorithm which, for any 2-coloration of 
the edges of Koo, will determine a monochromatic Ky, requires more than c” 
steps (i.e., exponential time), where c > 1 is an absolute constant and a step 
means asking the colors in an arbitrary matching at once. 
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This conjecture was suggested by the following easy corollary of the 
Lovasz Local Lemma (see [19]). 


Theorem 18. If G is an arbitrary graph (finite or infinite) with maximum 
degree at most 2"/?, then the edges of G can be 2-colored such that there is 
no monochromatic Ky. 


The remarkable feature of the Local Lemma is that it guarantees the 
existence of a “needle in a haystack”. Indeed, if the number |G] of edges of G 
is a super-exponential function of n, then the probability of “success” for the 
standard random 2-coloring is an exponentially small function of |G]. So the 
Local Lemma does not supply even an efficient randomized algorithm. 

Recently we managed to convert some of the applications of the Lo- 
cal Lemma into polynomial time sequential algorithms at the cost of a 
weaker constant factor in the exponent (see [8]). The algorithmic version of 
Theorem 18 goes as follows (note that its proof heavily uses game-theoretic 
ideas, especially “derandomization” and the trick of “BIG GAMES” and 
“small games” ). 


Theorem 19. Let G be a finite graph with maximum degree at most 
2"/96 | Then there is a deterministic sequential algorithm with running time 
|G|\corst which produces a 2-coloring of the edges of G such that there is no 
monochromatic Ky. 


A natural approach to attack the Conjecture above is to try to convert 
the algorithmic proof of Theorem 19 to a game-theoretic strategy. However, 
some surprising (or natural) technical obstacles prevented us from realizing 
this heuristic. 

On the other hand, we could prove the following interesting partial result: 
the Conjecture above holds if a step means asking the colors in a matching 
of size < 2”"/10 at once (see [9]). 


Theorem 20. A parallel matching algorithm which, for any 2-coloration of 
the edges of Ko, will determine a monochromatic Ky, requires more than 
const - 2"/49 steps (i.e., exponential time), where a step means asking the 
colors in an arbitrary matching of size at most 2r*/10 at once. 


We emphasize that gn’/10 ig super-exponential, that is, gn’/10 ig asymp- 
totically much bigger than 4” (which is the number of edges necessary to get 
a monochromatic K,, via the “ramification” method described above). This 
justifies our intuition that independent edges do not help too much to build 
a monochromatic K,, up. 

It is worth to note that, in contrast to the Conjecture (and to Theorem 20), 
where the parallel matching complexity is exponential, the parallel star 
complexity of the Ramsey theorem is linear. Indeed, the ramification method 
requires the testing of at most 2n—1 stars (each having at most exponentially 
many edges). 
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Consider the following game-theoretic equivalent of Theorem 20. Two 
players, I and II, are playing on the “board” K,, = [N]?. On each move, Player 
I selects m = 2”’/!° previously unselected point-disjoint edges of Ko. = [N}? 
(i.e., an m-matching) and Player II colors it by red and blue. Player I wins if 
there exists a monochromatic copy of K,, in the graph selected by him during 
the play; otherwise Player II wins. That is, the aim of Player II is simply to 
prevent Player I from achieving his goal. Theorem 20 is equivalent to the 
statement: Player II has a strategy such that Player I is unable to win this 
m-matching game within const - 2"/4° moves. 
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1. Introduction 


This article summarizes progress on several old hypergraph problems of Paul 
Erdés and a few questions to which they led. Quite unexpectedly, there turned 
out to be substantial connections between the problems under discussion, 
surely some indication (if any were needed) that Erdés’ questions were the 
“right” ones. Here’s a quick synopsis. 

The story basically begins about 10 years ago, with Vojta Rodl’s beautiful 
proof [80] of the “Erdés-Hanani” Conjecture (Sect. 4). His proof was based 
on a powerful “semirandom” or “guided-random” approach. (I wish there 
were a better name for this.) A similar method had earlier been used in a 
less precise context by Ajtai, Komlés and Szemerédi [1] and Komlés, Pintz 
and Szemerédi [70]. Substantial extensions of Rédl’s work were subsequently 
achieved by Frankl and Rédl [38], Pippenger (see [87] or [42]), and Pippenger 
and Spencer [77] (see Sects. 4 and 6). 

Most of the work described in this paper had its beginnings in attempts 
to apply these ideas to prove a nonlinear lower bound on the function n(r) 
of Erdés and Lovasz discussed in Sect. 3. In the event, n(r) turned out to be 
linear, though discovering this would certainly not have been possible if the 
results of those initial attempts (see Sect.5) had not suggested where—or at 
least where not—to look for examples. 

In the meantime, an understanding of the above-mentioned results, 
particularly [77], had led to a proof of the “asymptotic correctness” of 
the well-known Erdés-Faber-Lovasz Conjecture (Sect. 2), which proof led 
eventually to a much stronger result (Theorem 12) on the asymptotic 
behavior of the list-chromatic index for hypergraphs; and further efforts to 
prove n(r)/r — oo had suggested the conjecture which eventually became the 
main result of Sect.5. (Theorem 9), and led in its turn to the investigations 
mentioned in Sects. 7 and 8. 

In this paper we mainly try to give an overview of these developments and 
connections, with discussion of proofs limited to hints at most. More detailed 
accounts of some of the material—especially more serious discussions of the 
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semirandom method—may be found in [42] and [60]. (I should also say that 
various bits and pieces of this article are borrowed from [60-62].) 


Terminology 


Throughout we use H to denote a hypergraph on vertex set V For further 
hypergraph background see, e.g., [42] or [9]. 

The degree (in H) of x € V is the number of edges of H containing 2, 
and is denoted dz,() or simply d(a). Similarly, d(x, y) denotes the number 
of edges containing both of the vertices x,y and d(X) the number of edges 
containing all vertices of X C V. We write D(H) for the largest degree in H. 
A hypergraph is D-regular if each of its vertices has degree D. 

A hypergraph is intersecting, resp. simple (or nearly-disjoint, but we won’t 
use this), if any two of its edges have at least, resp. at most, one vertex in 
common. 

For X,Y € HUV, the distance from X to Y, denoted A(X,Y), is the 
least m for which there exists a sequence X = Xo,...,Xm = Y from HUV 
such that for each 7, X;_; is an element of X; or vice versa. 

A matching of H is a collection of pairwise disjoint edges, and the size 
of a largest such collection, denoted v(H), is the matching number of H. We 
write M(H) for the set of matchings of H. 

A vertex cover (clearer would be “cover of edges by vertices”) of H is a 
set of vertices meeting every edge of H, while an edge cover is a collection 
of edges whose union is V. Either of these may be shortened to “cover” if 
there seems no danger of confusion. The vertex and edge cover numbers of H 
are the minimum sizes of its vertex and edge covers, and are denoted 7(H) 
and p(H). 

Each of v, 7, p has a fractional counterpart, obtained by regarding the 
object in question as the solution of an integer program and taking the linear 
relaxation thereof. Thus a fractional (edge) cover—the only one of the three 
needed here—is a function t: H > R* satisfying 


S tA) >1 VreV, (1) 


Ada 


and the fractional (edge) cover number is 
p’(H) = min{) > t(A) : t a fractional edge cover of H}. 


We also say that t: H — R* is a fractional tiling if equality holds in (1). 

The chromatic index (or edge coloring number) of H, denoted x'(H), is 
the least ¢ for which there is a “coloring” o : H — [t] which is proper in 
the usual sense that o(A) #4 o(B) whenever A, B are distinct, nondisjoint 
edges. Equivalently, x’/(#) is the least size of a collection of matchings whose 
union is H. We also write ¢(#) for the greatest size of a collection of pairwise 
disjoint covers contained in H. 
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These too have fractional versions, of which we only need the fractional 
chromatic index of H, denoted (unfortunately) y’*(H), and defined as the 


minimum value of 
Ss) f(M) 
MEM 


over f: M > R¢ satisfying 


SS) f(M)>1 VAEH. 
AEMEM 


Finally we need to say a little about asymptotic notation. For nonnegative 
f,g we use f ~ gand f <g for “f/g > 1” and “limsup f/g < 1”, with limits 
taken as some relevant parameter tends to infinity. We also write f =- g for 
(l+e)"! < f/g<1+e. As usual we use f = O(g), f = 0(g) and f = w(g) 
for (respectively) sup(f/g) < oo, f/g > 0 and f/g > ~w. 

We adopt the “uniformity convention” of [77], viz: any limiting statement 
involving one or more free variables ranging over vertices, edges or hyper- 
graphs is understood to hold uniformly with respect to all possible choices of 
these variables, as some specified numerical parameter tends to infinity. (See 
Theorem 7 for a first instance of this.) 


2. The Erd6és-Faber-Lovasz Conjecture 


To avoid trivialities, hypergraphs in this section are assumed to have no 
singleton edges. 

The celebrated Erdés-Faber-Lovasz Conjecture may be stated as follows 
(see [53]): 


Conjecture 1. Any simple hypergraph H on n vertices has chromatic index 
at most n. 


Erdés has for many years listed this as one of his “three favorite 
combinatorial problems” (the other two being the A-system problem of Erdés 
and Rado, and the problem of Erdés and Lovasz described in Sect. 3), and 
currently offers $500 for its resolution (see, e.g., [29]). 

Notice first of all that the Conjecture is sharp in the case H is either 


(a) A projective plane or degenerate projective plane (the latter being the hy- 
pergraph with vertex set {0,1,...,n—1} and edge set {{0,1},...,{0,n— 
1}, 1 sey Th Ly); or 


(b) A complete graph on n vertices, n odd. 


(Sufficiently minor modifications of (b) also give equality.) 
On the other hand: 
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(c) For intersecting H, Conjecture 1 is just the de Bruijn-Erdés Theorem [20], 
which says that if |AM B| = 1 for all distinct A,B € H, then |H| < n 
(with equality only for H as in (a)). 

(d) For graphs, Conjecture 1 is contained in Vizing’s Theorem [90] stating 
that the chromatic index of a simple graph of maximum degree D is at 
most D+ 1. (Of course this special case—Vizing’s Theorem for complete 
graphs—is easily proved directly. On the other hand, as observed, e.g., 
by Meyniel (unpublished), Berge [10] and Fiiredi [41], it seems likely that 
the bound in Conjecture 1 can be replaced by max,ey | Ussz Al, which 
for graphs reduces to Vizing’s Theorem in full.) 


Graphs and intersecting hypergraphs are in some sense the extreme cases 
of Conjecture 1. One of the problem’s most appealing aspects is that it 
has proved so intractable despite being manageable at these extremes, and 
apparently less accurate between them. 


Bounds 


The history of results on Conjecture 1 is rather brief, surely more an 
indication of the difficulty of the problem than of any lack of attempts to 
resolve it. The first significant progress was made by P. Seymour, who showed 


Theorem 1 ((84]). If H is simple on n vertices, then v(H) > |H|/n, with 
equality only in the cases (a) and (b). 


Note this is immediate from Conjecture 1. An intermediate statement was 
conjectured in [84] and proved in [68]: 


Theorem 2. If H is simple on n vertices, then y'* <n. 


Equivalently (by LP-duality), 


Vf:H—> RIM € M(H) such that 
S“{f(A): A€ M} Sn" S“{f(A): AEH}. (2) 


(So taking f = 1 we recover Theorem 1.) The proof of Theorem 2 turned out 
to be much simpler than that of Theorem 1 because it was possible to exploit 
properties of a worst f in (2). 

It seems to have been noticed by several people that a greedy coloring of 
edges of H in any nonincreasing size order requires at most 2n — 3 colors. In 
the absence of edges of size 2 this bound shrinks to about 3n/2, and Chang 
and Lawler [21] showed how to modify the greedy procedure to achieve the 
same bound (precisely, y’/(H) < [1.5n — 2]) in general. That Conjecture 1 is 
at least asymptotically correct was subsequently proved in [58]: 
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Theorem 3. If H is simple on n vertices, then x'(H) <n+o(n). 


The proof of this is based on the “semirandom” method discussed below 
(see especially Sect. 6), and actually gives y’/(H) < (1+o0(1)) maxzey |Uss2A| 
(c.f. (d) above). 


Digression: Borsuk and Larman 


There’s at least a formal similarity between Conjecture 1 and the following 
problem of Larman [73]. (A hypergraph is t-intersecting if any two of its 
edges share at least t vertices.) 


Conjecture 2. If H is a t-intersecting hypergraph on n vertices, then H = 
H,U-:-UH, with each H; (t + 1)-intersecting. 


This is motivated by, and for uniform 1 is a special case of “Borsuk’s 
Conjecture” that every bounded set in R@ is the union of d+1 sets of smaller 
diameter ([19]; see [17, 24, 46] for further discussion). 

Conjecture 2 and Borsuk’s Conjecture were recently disproved in [65]. (We 
again come back to Erdés. The disproof is a simple application of a Theorem 
of Frankl and Wilson [39] which has its roots in the de Bruijn-Erdés Theorem 
and Fisher’s inequality [36]. Erdés was also one of the first to suggest that 
Borsuk’s Conjecture might be false [30].) 

The case t = 1 of Conjecture 2 remains open (and interesting). Here 
Fiiredi and Seymour (see [31, 68]) proposed the stronger conjecture that one 
may use #,’s of the form {A € H: AD {x,y}} for appropriate vertex pairs 
{x,y}. This too turns out to be false [64], though a simple disproof would 
still be welcome. (Curiously, the random construction of [64] takes just a few 
lines to describe, but as of now about 20 pages to justify.) 


3. A Problem of Erd6és and Lovasz 


In a seminal paper [33], Erdés and Lovasz pose the problem of estimating, 
for positive integer r, 


n(r) := min{|H| : H r-uniform, intersecting, with 7(H) =r}. 


That is, with how few intersecting r-edges can one force T = r? While the 
conditions here may at first glance seem a little arbitrary, notice that we must 
require “intersecting,” or some substitute, to make the question nontrivial, 
and that once we assume “r-uniform, intersecting,” we are just asking that 7 
be as large as possible subject to these conditions. Thus the Erddés-Lovasz 
problem is a quite natural way of making concrete the vague question, how 
can one economically force large cover number in a hypergraph with large 
edges? 
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Erdés and Lovadsz showed (writing P, for any projective plane of order 
r—1) 


n(r) > 8r/3 — 8 for all r, (3) 
and 
n(r) < 4r°/? log r if there exists a P,, (4) 
the second inequality being an immediate consequence of 


Theorem 4 ([33]). Jf H is a set of m > 4r3/? logr random lines from Py, 
then Pr(t(H) =r) > I(r > ov). 


They also conjectured that the correct rate of growth here should be 
rlogr. This was shown in [59]: 


Theorem 5. Jf H is a set of m > 22rlogr random lines from P,, then 
Pr(r(H) =r) > 1(r > oo). 


Of course this also gives the corresponding improvement in (4). The 
correct value of m here is probably about 3r log r; see [59] or [60] for a precise 
statement. 

The problem from Erdés’ “list of three” was to decide whether 


n(r) = O(r). (5) 


This was done in [61]. The answer—that (5) is true—was probably not what 
most people expected. (Certainly it wasn’t what the author expected.) 

We don’t have space to go into the construction here, but want to mention 
that one ingredient is the work of Chowla, Erdés and Straus [23] on the 
existence of large sets of mutually orthogonal Latin squares. See also the 
discussion following Theorem 9 for a small additional hint at what’s involved. 

The constant in (5) is so far not very good. Quite surprisingly the best 
lower bound is still (3), though I feel quite certain this could be improved 
somewhat via the ideas of [57, 66] discussed in Sect. 5. 


Meyer’s Problem 


In connection with n(r), let us just briefly mention a related problem of 
similar vintage due to J.-C. Meyer [76]. Meyer defined 


m(r) = min{|H| : H a maximal intersecting, r — uniform hypergraph} 


and conjectured that m(r) > r? — r +1 (projective planes being the obvious 
examples). This was disproved by Fiiredi [40], who showed 


m(r) < 3r?/4 if there exists a P,/241.- (6) 


On the other hand, despite a fair amount of subsequent effort, it remains quite 
unclear how m(r) ought to grow. As of now the best results are (from [11, 18] 
and [25] respectively) 
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m(r) < r° Vr, (7) 
m(r) < r*/2+ O(r) if there exists a P,, (8) 

and 
m(r) > 8r for r > 4. (9) 


(The lower bound is a slight improvement on m(r) > 8r/3 —3, which follows 
from (3), since trivially m(r) > n(r). See also [60] for a proposed construction 
for m(r) = o(r”) when there exists a P,.) 

While the examples for n(r) described above don’t seem to give anything 
for m(r), they seem to me strongly to suggest the truth of 


Conjecture 3. m(r) = O(r). 


4. The Erdos-Hanani Conjecture and Asymptotics 
of Packing and Covering Problems 


Both Theorem 3 and, in a sense, the proof of (5) had their roots in yet 
another Erdés problem, the so-called “Erdés-Hanani Conjecture” of 1963, 
and in Rodl’s beautiful and seminal proof thereof. Here and in the next two 
sections we outline work which grew out of Rddl’s Theorem. As mentioned 
earlier, much of this material, and in particular the powerful “semirandom” 
approach underlying it all, was discussed at some length in [42, 60], so we 
will be pretty brief here, especially as regards the proofs. 


The Erdés-Hanani Conjecture 


For positive integer t, say a family F of subsets of a set V is a t-packing 

(resp. t-cover) if each t-subset of V is contained in at most (resp. at least) 

one member of F. For2<t<k<v=|V|, let P(v,k,t) (resp. C(v,k,t)) 

denote the size of a largest t-packing (resp. smallest t-cover) of k-sets in V. 
Erdés and Hanani [32] proved that the obvious bounds 


P(v,k,t) < @ (‘) < C(v,k,t) (10) 


t 


are asymptotically tight for t = 2 and any fixed k, and conjectured the same 
result for every t and k. This is Rédl’s Theorem: 


Theorem 6 ([80]). For every fixed t and k, 


P(v,k,t) ~ (‘) (‘) ~ O(v,k,t). 


350 Jeff Kahn 


(This is an asymptotic version of a well-known conjecture in the theory 
of block designs which states that the bounds (10) are exact for large enough 
v satisfying the obvious necessary conditions 


a _ 
(9) (71) weocigent 
t—4 t—4 


For ¢ = 2 this was proved by R. M. Wilson in the early 1970s [93], but for 
t > 3 a proof still appears remote.) 

In other language, Theorem 6 gives the asymptotics of the matching and 
edge cover numbers of the hypergraph H = {(“) : K € (¥)} on the vertex 
set (od In fact, as shown by P. Frankl and Rédl [38], Rédl’s Theorem is just 
one instance of a remarkably general packing and covering phenomenon for 
hypergraphs with bounded edge sizes. An even stronger and cleaner version 
of their Theorem was proved by N. Pippenger (unpublished; for the original 
proof see [87] or [42]): 


Theorem 7. Let k be fixed and H a k-uniform D-regular hypergraph on 
n vertices satisfying 


d(x,y) < o(D) for all distinct vertices x,y. (11) 
Then 
V(H) ~ n/k ~ p(H). 


(The Frankl-Rédl Theorem differs from Theorem 7 in requiring an explicit 
bound (roughly D/(log|V|)*) on pairwise degrees. Incidentally, Theorem 7 
is our first use of the “uniformity convention” (see Terminology): limits are 
taken as D > oo, with convergence uniform in x, y and H.) 


The “semirandom” Approach 


Joel Spencer remarks in [87] that the Erdés-Hanani Conjecture always 
seemed a natural candidate for a probabilistic proof. And the proof was 
probabilistic. .. 

A natural way to try to prove Theorem 7, say for matchings, would be as 
follows. Let Hp =H, Mp = @, and for i=1,... do 


(i) Choose A; uniformly at random from H;-1, 
(ii) Set M; = M;_1 U {A;} and H; = {A EH;-1: ANA; = pb}. 


When H; = 9 we stop and take M; to be our matching. 

Most likely this procedure does work (e.g., in the sense that the random 
matching it produces has expected size asymptotic to n/k), but I don’t think 
anyone knows how to show this at the moment. Rddl’s fundamental insight 
(translated to the proof of Theorem 7) was that we can do the analysis if at 
each stage we choose a small but fixed (positive) proportion of the desired 
matching M, rather than just one edge. 
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To say this a little more precisely, we switch from matchings to covers for 
a while. Thus we want to show p(H) < (1+ )n/k for 6 > 0 fixed, H as in 
Theorem 7, and sufficiently large D. 

Fix ¢ > 0 small relative to 6. Let Hyp = H, Vo = V, and iterate the 
following procedure for i running from 1 to about ¢~' log(1/6). Let K; be a 
random subset of H;_; chosen according to 


Pr(A E K;) = e/Di-1 
(for D; see below), these events mutually independent. Set 
Vi=Vin\ {A: AER}, He= {Ae Hii: AC Vi}. 


After the specified number of iterations we add to UX; one edge containing x 
for each « € V not covered by UK;, and claim this (usually) gives the desired 
cover. Of course what needs to be shown is that | U K;| is typically about 
n/k, while |V \ U{A : A € UK;}] is small relative to n. 

The key to the success of this approach is that we can understand 
how various relevant quantities—|K;|,|Vi|,|Hi|, and especially degrees in 
H;— ought to evolve, and, moreover, show that they do typically evolve 
approximately as they ought. In particular, each “residual” hypergraph H; 
will be close to regular, meaning most of its vertices will have degree close to 
some (predictable) Dj. 

To see why this might be true, suppose we fix 2 € V;_; and condition on 
{x € V;} (that is on {2 € AC Hi-1 > A ¢ K;}). Then writing X, for the 
indicator of {A € Hi}, du,(x) = 0{X4: a € A © Hi_1} is usually the sum 
of about D;—; Bernoulli random variables whose expectations are, because of 
the approximate regularity of H;; and (11), about (1 — ¢/D,_1)“-Y)?-. 
Moreover, again because of (11), there is considerable independence among 
these random variables, enough to enable us to say (via Chebyshev’s 
inequality) that dy, (x) is likely to be close to its expectation. 

Actual implementation of this rough description requires considerable 
care. In particular, it does take some thought to convince oneself that the 
various estimates don’t deteriorate unacceptably over the specified number 
of iterations; but we won’t go into this here. 

For the number of iterations, note that the “natural” value of Pr(x € 
Vilx € Vi-1) is about (1 —e/D;_1)?*-1 = e~*, so that e~1 log(1/6) iterations 
should reduce the number of vertices to about 6|V]. 


Remarks. 


1. A technical but important point is that, if n is large relative to D we 
cannot guarantee that all degrees in H; are close to D;. It was precisely 
in the handling of this point that Pippenger improved on [38]. 

2. For the matching portion of Theorem 7 we may dispense with the final 
augmentation of UX; and simply take our matching M to consist of all 
isolated edges of UX;. The number of edges which this removes from 
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K; should be about e|K;|, an acceptable loss. Actually the two parts 
of Theorem 7 are easily seen to be equivalent, but for the proof of 
Theorem 10 below one wants a procedure for generating a nicely behaved 
random matching; see Theorem 11. The procedure just described— 
essentially that of [77|—is an improved version of Pippenger’s original 
proof designed to accomplish this. 


5. Fractional Versus Integer 


As stated earlier, the starting point for most of the work discussed here was 
the realization that something like Theorem 7 could be used to try to prove 
n(r)/r — oo. In this section we give a little indication of this connection 
and sketch the work (other than [61]) that evolved most directly from this 
attempt. 


Connection with n(r) 
For t:# > R*, let t(H) = )-{t(A): A € H}, define t: 2” > Rt by 
(A) = ST{(B): B > A}, 


and set 
a;(t) = max{t(W) : W CV, |W| = +}. 
For example, if H is r-regular, then for the fractional tiling t = 1/r we have 
a(t) =r~' max{d(z, y)} and (11) (with r replacing D) is equivalent to 
Q2(t) > 0, (12) 
so that Theorem 7 is contained in 


Theorem 8 ([57]). Let k be fixed, H a k-bounded hypergraph, andt:H— 
R* a fractional cover. Then 
p(H) StH) (a2(t) 0). 


(A similar result holds for matchings, but we confine ourselves here to 
covers.) 

To see the connection with the Erdés-Lovasz problem, we dualize: n(r) 
is the least number of vertices in an r-regular hypergraph satisfying 


d(x, y) > 0 for all distinct vertices x, y (13) 


and having edge cover number r. Thus the following consequence of Theo- 
rem 8 is relevant. 


Corollary 1. Suppose H is r-regular with at most cr vertices, c fixed, 
d(x,y) >0 for allz,y €V and 
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max{d(xz,y): x,y € V,a Ay} =o(r). 
Then p(H) < (e/(e+ 1) + o0(1))r. 
Or undualized: 


Corollary 2. Suppose H is r-uniform, intersecting, of size at most cr, 
c fixed, and satisfies 


max{|AN B|: A,B eH,A# B} =o(r). (14) 
Then T(H) < (c/(e+ 1) + 0(1))r. 


After a preliminary step which eliminates large edges, the connection 
between Theorem 8 and Corollary 1 is provided by the observation that if 
H is r-regular with n < cr vertices and satisfies (13), then the function 
t:H— R® given by 


t(A) =|A|/(n+r-—-1) (15) 
is a fractional cover of total weight 
S- t(A) =nr/(ntr—1) xe nr/(n+r) <er/(e+1). (16) 


ACH 


Notice also that larger pairwise degrees—corresponding to larger intersec- 
tion sizes in the original formulation—will tend to give even smaller fractional 
cover number, suggesting that the best hope for proving (5) should indeed be 
via something akin to the projective plane based constructions of Theorems 4 
and 5. But the above results say that one cannot prove (5) with H’s in which 
all edge intersections are small. 

This seemed for quite a while to support the opinion that n(r)/r + oo. 
But the correct lesson, very roughly, was that one should allow a few 
strategically placed small sets X with large d(X). This, it turns out, can 
be done in such a way that the value of the fractional cover doesn’t shrink 
too much, but we lose Theorem 8 entirely. 

The proof of Theorem 8 is similar to that of Theorem 7. At each stage 
we have some fractional tiling t;_1 of the remaining hypergraph H;_-1 which 
guides the choice of K;: we take each A € H;_1 to be in K; with probability 
etj_1(A). 

It’s then necessary to update t;_1 in addition to V;_, and H;_1. A nice 
bonus of the more general framework is that, because we are not restricted 
to uniform hypergraphs, the difficulty mentioned in Remark 1 at the end 
of Sect.4 here essentially takes care of itself. Our random procedure will 
produce a hypergraph G C H;_; and approximate fractional tiling s. We can 
then (with high probability) replace G by some H; < G (meaning each edge 
of H; is contained in an edge of G) and s by a fractional tiling t; of H; such 
that t;(H;) ~ s(G). In particular, we begin each iteration with a fractional 
tiling, the fractional analogue of a regular H4_1. 
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Local Behavior 


The work in [66] again grew out of attempts to push the approach of 
Theorem 8 to prove n(r)/r — oo. The general idea was that it should be 
possible to relax (12) to a requirement that “locally’—that is, on small sets 
of vertices—one can find ordinary (integer) covers which mimic the fractional 
cover t. A way to make this precise is as follows. 

For t: 1 — Rt and any X CV, define t|x : 2* > Rt by 


t|x(A) = 0 {t(B): BN. X = A}. 
Write M P(X) for the matching polytope of X: 
MP(X) = conv{1,, : M a matching of 2*}. 


Denote by b(t) the largest b such that for any X C V with |X| < b we have 
tly € MP(X). In place of (12) we then require that b(t) > oo. (Note this is 
weaker than (12).) 

Suppose for example that V(H) is partitioned into triples, and that we 
allow t({x, y}) to be large when x,y are in the same triple and take each 
edge of H to meet each triple in either 0 or 2 vertices. Then b(t) = 2 and 
it’s more or less typical (e.g., take H regular and uniform) for p(H) to be 
about (4/3)p*(#), reflecting the fact that we have p(I’) = (4/3)p* (I) for the 
underlying graph [ of pairs for which f is allowed to be large. 

But—this was the starting point—the fractional covers (15) arising in 
connection with n(r) cannot look like this, and in fact b(t) does tend 
to be large in situations of interest for n(r). (To see what’s meant here, 
assume most pairwise degrees in H are 1—if they’re not then we gain 
substantially in (16)—and use the fact that ¢ in (15) is given by t(A) = 
(n+r—1)7' >> cy 1¢452} to show that for X not too large, t|x is (usually, 
approximately) in MP(X).) 

At any rate, it turns out that “b(t) + co” is the correct relaxation of 
(12), provided we at least insist that a3 be small: 


Theorem 9 ([66]). Let k be fixed, H a k-bounded hypergraph, andt:H— 
R* a fractional tiling. Then 


P(H) St(H) (a3(t) + 0, b(t) + oo). 
This implies for example that in Theorem 8 we could replace a2(t) by 
max{t({z,y}):2,y eX or z,yEY} 


where XUY isa partition of V. That is, allowing ¢ to be large on the edges of a 
bipartite graph doesn’t create obstructions to good cover behavior, reflecting 
the fact that fractional and integer edge cover numbers coincide for bipartite 
graphs. 
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(Though this has yet to be checked, Theorem 9 probably implies that 
Corollary 2 holds even if we relax (14) to the analogous condition on 3-wise 
intersections.) 

If we allow as(t) to be large, then the situation changes completely. 
For instance, the dual, H, of a random cubic graph (as in [8, 12, 13]) is 
a 3-uniform, 2-regular hypergraph which typically has no short cycles, yet 
has p substantially greater than |V|/3. Thus the fractional tiling t = 1/2 has 
large b(t), yet p(H.) is much larger than t(H). 

I think it’s fair to say that it’s this phenomenon that lies at the heart of 
the construction of [61]: there the “small sets X with large d(X)” mentioned 
following (16) comprise a hypergraph with properties akin to those described 
in the preceding paragraph. 

Theorem 9 was conjectured in [60]. The proof turned out to be both harder 
and much more interesting than originally anticipated, involving, centrally, 
some understanding of the behavior of so-called “normal” distributions on 
the set of matchings of a graph. This connection is sketched a little in Sect. 7. 
The questions raised in [66] also led, if somewhat tangentially, to the work 
on random matchings outlined in Sect. 8. 


6. Chromatic and List-Chromatic Indices 


It was suggested by Fiiredi [88] that the hypotheses of Theorem 7 might 
guarantee the existence not just of one good matching or cover, but of a 
decomposition of the entire hypergraph into matchings or covers which are 
good on average. This was proved by Pippenger and Spencer. 


Theorem 10 ([77]). Under the hypotheses of Theorem 7, 
x'(H) ~ D(H) ~ o(H). 


(As noted in [77], the second assertion of Theorem 10 follows from the 
first; here we restrict our attention to y’.) 

Theorem 10 is based on an elegant variant of Theorem 7, which we state 
for future reference. 


Theorem 11. For every e >0 andk there are 6 > 0 andt so that if H is a 
k-uniform, D-regular hypergraph on V with 

d(z,y) < 6D Va,y € V, 
then there is a probability distribution p on the set M of matchings of H 
satisfying 


(a) {p(M): Ae Me M}=.1/D VAEH, 
(b) For M chosen according to p, and A © H, the event {A € M} is 
independent of the events {{B € M}: A(A, B) >t}. 
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In fact what’s shown in [77] is that the distribution obtained from the random 
procedure sketched in Sect. 4 has these properties. (The present ¢ and 6 are 
not those used earlier). Of course for the expected size, say yz, of a random 
matching drawn from a distribution satisfying (a), we have p =. n/k. This 
(letting « + 0) yields Theorem 7, and says that matchings of the desired size 
are plentiful in some sense. 

The proof of Theorem 10 is in the same vein as that of Theorem 7. Rather 
than cover vertices by edges, we must cover edges by matchings. Again we 
proceed in stages, choosing at each stage enough random matchings to cover 
a small but constant fraction of the surviving edges. Here, in contrast to the 
earlier situation, it is far from obvious what ought to be meant by “random 
matchings”; but matchings drawn from distributions as in Theorem 11 work 
very nicely (except that we should relax D-regularity in the theorem to 
something like “d(x) =; D Vr € V”). 

The point of (b)—I think this the nicest new idea here—is that it 
supports use of the Lovasz local lemma ((33] or e.g. [4]) to say that our 
small sets of matchings have positive probability of being well-behaved at 
every vertex. (Here and again in Theorems 12 and 13, application of the 
local lemma requires much stronger concentration assertions than are given 
by Chebyshev’s inequality. These are achieved via martingales: the so-called 
“Azuma-Hoeffding” inequality ({7, 54], or e.g. [14, 75]) in the present instance, 
and extensions thereof for the later results.) 


List Colorings 


The final (for now; see Conjecture 7) development in the direction we’ve 
been discussing was an extension of Theorem 10 to list-colorings which was 
conjectured in [58] and proved in [62]. 

Recall that the list-chromatic index, x}(H), of H is the least t such that 
if S(A) is a set (“list”) of size t for each A € H, then there exists a proper 
coloring o of H with a(A) € S(A) for each A € H. Of course x} is always at 
least x’, so Theorem 10 is contained in 


Theorem 12 ([62]). Let k be fixed and H a k-bounded hypergraph of 
maximum degree D satisfying (11). Then 


x1(H) ~ D (D — ov). 


List-colorings have recently been getting a lot of attention. Here we just 
want to give enough background on list-chromatic indices of graphs to put 
the graphic case of Theorem 12 in context. But see [3] for a survey of recent 
results, and, e.g., [47, 48, 62] for more on what’s touched on here. Let us also 
mention that we are, as usual, in Erdos territory: the study of list colorings 
was initiated by Vizing in [92] and, independently, Erdés, Rubin and Taylor 
in [34]. 
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The following central problem, now called the “list-chromatic” or “list 
coloring” conjecture, seems to have been proposed several times, probably 
first by Vizing in 1975 (see, e.g., [22, 47, 48] for more on this story). 


Conjecture 4. For every multigraph G, x}(G) = x/(G). 


The case G = K,,,, was proposed by J. Dinitz in about 1978 (see [28]) 
in the context of Latin squares. This version is particularly appealing, and 
seems to have provided much of the initial stimulus for western interest in 
such questions. 


Conjecture 5 (Dinitz Conjecture). Suppose that for 1 < i,j <n, Si; 
is a set of size n. Then there is a partial Latin square (8i,;)1<i,j<n with 
Sif = S55 for all a, 9. 


(A partial Latin square of order n is an n x n array of symbols with the 
property that no symbol appears more than once in any row or column.) 

Let us just quickly mention that there are natural extensions of these 
problems to vector spaces and matroids. For example, the following was 
proposed in [62] as a common generalization of the Dinitz Conjecture and 
a conjecture of G.-C. Rota [55] (see [55, 62] for more in this direction). 


Conjecture 6. Let V be an n-dimensional vector space and suppose that for 
1l<i,gj <n, Si3 ts a basis of V. Then there exist s;,; € Si; for 1 <it,7 <n 
such that each of the sets {s;,;:j =1,...,n}, {5i,; :t=1,...,n} ts a basis 
of V. 


For simple G, Conjecture 4 together with Vizing’s Theorem would imply 
that D(G) < x,(G) < D(G) +1, while Theorem 12 says that D(G) is at 
least the right asymptotic value. This improved several earlier bounds (e.g., 
[15, 16, 22]), the best of which was 


xi(G) < 7D(G)/4 + o(D(G)) 


due to Bollobas and Hind [16]. 

We haven’t had space in this very brief summary to discuss a beautiful 
algebraic approach to list colorings which was introduced by Alon and Tarsi 
in [5] and has had several important consequences ([27, 37] or [3]). Recently 
J. Janssen [56] used this approach to give a simple and elegant proof that 
X1(Kninti) = n+ 1 (which in particular says that the Dinitz Conjecture is 
not off by more than 1), and then Haggkvist and Janssen [48], taking [56] as 
a starting point, showed, inter alia, 


xXj(G) < D+ O(D?/? log D) (17) 


for simple bipartite G of maximum degree D. At this writing they can also 
show x}(K,) < n+ 1, and expect that their method will extend to give (17) 
for nonbipartite graphs. 
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Semirandom Again 


Theorem 12 was conjectured in [58], where a much more limited extension 
of Theorem 10 was used to prove Theorem 3. (Derivation of Theorem 3 from 
Theorem 12 is left as a nice exercise for the reader; or see [60].) While it was 
nice to see the asymptotic correctness of Conjecture 1, it’s my (perhaps not 
majority) opinion that the most important thing to come out of [58] was the 
right conjecture along these lines, namely what became Theorem 12. 

The special case of Theorem 12 proved in [58] requires a good under- 
standing of [77] but not too much in the way of new ideas. Theorem 12, on 
the other hand, for some time seemed beyond reach, an opinion influenced in 
part by the apparent difficulty of even the graphic case, and in part by the 
fact that the Pippenger-Spencer proof clearly would not extend. 

The basic idea of the eventual proof is actually quite natural, though a 
little strange in that it initially seems doomed to failure. Here’s a thumbnail 
sketch in the “standard” case that all the S(A)’s are the same. (The general 
case is not essentially different, but involves some fiddling to keep the relevant 
parameters on track.) 

We color the hypergraph in stages. At each stage we tentatively assign 
each as yet uncolored edge A a random color from its current list of legal 
colors. In some (most) cases, the color tentatively assigned to A will also be 
assigned to one or more edges meeting A. Such edges A are simply returned to 
the pool of uncolored edges. The remaining edges (those not involved in such 
“collisions” ) are permanently colored with their tentative colors and removed 
from the hypergraph. We then modify the lists of legal colors (mainly meaning 
that we delete from S(A) all colors already assigned to edges which meet A) 
and repeat the process. 

Martingale Concentration results together with the Lovasz local lemma 
are used to show that this procedure can be repeated many times, leaving 
after each stage a hypergraph and modified lists, of legal colors which are 
reasonably well-behaved. (Finding the correct definition of “well-behaved” is 
crucial.) Eventually our control here does deteriorate, but by the time this 
happens the degrees in the remaining hypergraph are small relative to the 
(minimum) number of colors still admissible at an edge, and the remaining 
edges can be colored greedily. 

The strange feature alluded to above is that the lists of legal colors initially 
shrink much faster than the degrees. (Roughly, when the degrees have shrunk 
to BD, with 8 not too small, the lists will have size about 3" D if edges are of 
size k.) This at first seems unpromising, since we are accustomed to thinking 
of degree as a trivial lower bound on chromatic index. What saves us here— 
this is perhaps the central idea of the proof—is that the lists $*(A) (of legal 
colors for A at the end of stage i) tend to evolve fairly independently, except 
where obviously dependent. So for example, for a color y which through stage 
i has not been permanently assigned to any edge meeting AM B (that is, we 
condition on this being so), the probability that ~ belongs to S*(B) is not 
much affected by its membership or nonmembership in $*(A). 
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Implementation of this idea is reasonably delicate; still, I think the 
proof demonstrates considerable flexibility for the “guided-random” approach 
(beyond what was already apparent from the results discussed above), and 
expect to see further applications in the near future. 

For example, about a year ago, J. H. Kim [69] used a similar method 
to make significant progress on Vizing’s old problem [91] of upper bounds 
for the chromatic number of a triangle-free graph G of maximum degree D, 
proving 


Theorem 13. [If G is a graph of maximum degree D and girth at least 5, 
then xi(G) < (1+ 0(1))D/ log D. 


(with the list-chromatic number \; defined in the obvious way). Here even 
for large girth the best previous upper bound was about D/2 due to 
Kostochka [71], though it seems reasonable to expect, particularly in view 
of [1, 2, 86], that the the bound of Theorem 13 remains valid for triangle- 
free G. 

(Recently, R. Haggkvist told me that A. Johansson and S. McGuiness, 
had just (independently) proved Kim’s result—following the method of 
Theorem 12 as described in [60|—and believed that for girth 4 they could 
show x(G) = O(D/ log D) and yi(G) = o(D).) 


A Conjecture 


Before closing this section, we mention one more (important) problem which 
recalls the “fractional vs. integer” theme of Sect. 5. 


Conjecture 7. For fired k and k-bounded hypergraph H, 
Xi) CH) ow CH): 
This goes far beyond Theorem 12, giving in effect—LP’s being regarded 
as “tractable” problems—a complete understanding of the asymptotics of 
chromatic and list-chromatic indices of k-bounded hypergraphs, even if we 


abandon assumptions such as (11) entirely. Note it contains Theorem 12 via 
Theorem 11, since the existence of a distribution p on M = M(H) satisfying 


S“{p(M):A4EMeEM}~1/D WAEH 


is the same as y/*(H) ~ D. 
Even the very special case of Conjecture 7, 


for multigraphs G, y/(G) ~ y"*, (18) 


is open, and of considerable interest. (For multigraphs y’* is given by 
Edmonds’ Matching Polytope Theorem ([26] or, e.g., [82]). More precise 
versions of (18) were proposed by Goldberg (see [89]), Andersen [6], Seymour 
[83] and again Goldberg [45]. The most important results on chromatic indices 
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of multigraphs are those of Shannon [85] and Vizing [90]; see also [35]. For x; 
of a multigraph G, the current upper bound is 9D(G)/5, due to Hind [52].) 
For one way of attacking (18) see Question 1 below. 


7. Normal Distributions 


As mentioned earlier, a central role in the proof of Theorem 9 is played by 
so-called “normal” distributions on the set of matchings of a graph. In this 
section we say what these are and try to give some idea of what they have to 
do with Theorem 9. Then in Sect.8 we describe some recent results for the 
special case of uniform distribution. 

Let G = (V, FE) be a graph and M = M(G) the set of matchings of G. 
For M € M, v € V, we write v < M if v is contained in some edge of M. 

A normal distribution on M is a probability distribution p = p) derived 
from some \: E —> R* according to 


w(M) = II AA, 
AEM 
p(M) = w(M)/ S> w(M’). 
M’EM 


For p a probability distribution on M, M € M chosen according to p and 
F, € E, set pr,,...p, = Pr(fi,...,F: € M). We call the probabilities pr for 
F € E the marginals of p. 

Let f : E + R*. Edmonds’ Matching Polytope Theorem says (though 
not in this language) that there is a probability distribution on M with 
marginals f iff 


SO f(F)<1 Wwev (19) 
F3v 
and 
SF): FCW} <||WI/2] WW. (20) 


The matching polytope, MP(G), is the set of such f’s. For normal distri- 
butions the analogous characterization was observed by Rabinovich, Sinclair 
and Wigderson [78]: 


Theorem 14. There exists a normal distribution with marginals f if and 
only if the inequalities (19) and (20) are strict for allu EV andW CV. 


We say that distribution p on M has the property Fd(6) if its marginal 
distribution f is in (1 — 6)MP(G), or equivalently, if the inequalities (19) 
and (20) hold even when their right hand sides are multiplied by (1 — 4). 
A crucial ingredient of the proof of Theorem 9 is a somewhat more technical 
version of 
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Lemma 1. For all 6, « > 0, andl there exists D such that if p has Ed(0) 
and if F,,...,F, € E are pairwise at distance at least D in G, then 


Thus, requiring that the marginals of p stay well away from the boundary of 
MP(G) guarantees a fair amount of (approximate) independence among the 
events {F € M}. 

Theorem 9 is proved by applying Theorem 8 to a sort of contraction of a 
randomly generated subhypergraph of the given hypergraph H. To give some 
idea of the relevance of Lemma 1, we make some simplifying, and slightly 
vague, assumptions. (The actual proof uses some preliminary reductions to 
arrive at similar, but somewhat weaker assumptions.) 

Suppose [ is a set of pairs—thought of as a graph—from V such that 
max{t({x, y}) : {x,y} € T} is small, and such that each A € H is the union 
of some collection of edges of IT’, called the parts of A, which are pairwise 
far apart in T. Under the latter assumption, the restriction é|p is a fractional 
tiling; moreover, it’s not hard to see that if b(t) is large enough then f := 
(1—V)t|r € (1—6)MP(L) for appropriate J, 5, both small positive constants. 

Write F < A if F is a part of A. Let p be the normal distribution with 
marginals f (as shown in [78], p is unique), and let M be chosen according to 
p. By the preceding remark, Lemma 1 applies to p. Define a new hypergraph 
H* on vertex set V*, and t* : H* > R* by 


V* ={F*: Fe M}, 
H* = {A*: AEH, all parts of A are in M} 
(with F* € A* iff F < A), and 


t*(A*) = [J f(y tA) VAT HT. 
FRA 


We prove Theorem 9 by showing that typically (using “~” and “<” only 
qualitatively in what follows) 


p(H) S p(X") S t'(H") = tH). 


Note for example that if the various events {F € M} were mutually 
independent, then we’d have E[t*(H*)] = t(#). 

Let us just say a little about the middle inequality, which is the heart of 
the matter. To prove it, we intend to apply a mild generalization of Theorem 8 
to the pair (H*,¢*). The main point (of the whole business) is to show that 
t* is (usually and in an appropriate sense) an approximate fractional tiling 
of H*. This goes roughly as follows. 

Suppose we condition on {F € M} (for some F € I) and consider 
t*(F*) = \o{t*(A*) : F* € A* © H*}. This has (conditional) expectation 
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So Pr(A* eH*|F eM) [I f(@ tA) & f(F)| SY H(A) 


A>F G<A A>F 
= f(F) MF) = (1-9) 1, 
since according to Lemma 1 and our simplifying assumptions, 


Pr(A* EH"|F €M)=Pr(GeEMVFAG<A\FeM)~ |] f£@. 
FAGRA 


Moreover—we leave the reader to ponder this nicest point—Lemma | enables 
us to use Chebyshev’s inequality to show that ¢*(F*) (again conditioned on 
{F € M}) is usually close to its mean. (There’s a slight lie here: we should 
add an assumption to the effect that for {x, y} € T, t({, y}) is not too small.) 

Thus, typically, #*(F*) will be close to 1 for most F* € V*, which is 
basically what we want. 


8. Random Matchings 


For the rest of our discussion we consider uniform distribution on M(G) 
(so the normal distribution corresponding to \ = 1). The main results here, 
Theorems 17 and 18, are surprisingly strong, and show that the distributions 
in question are in some respects much nicer than seems to have been 
previously realized. 

The material of this section is a little tangential to the topics of Sects. 2-6, 
but I include it here for two reasons. First, I do think of it as growing out 
of the earlier work. Second, understanding the extent to which the results 
described here extend to k-bounded hypergraphs would greatly enhance our 
understanding of these objects, and would in some cases (see in particular 
Conjecture 8 and Question 1) have specific consequences for questions 
considered above. 

Let G be a graph and let M be drawn uniformly at random from M(G). 
Mainly following [74, p. 341], we set € = €(G) = |M| and p;,(G) = Pr(é =k), 
and let « = (G) and o = o(G) denote respectively the mean and standard 
deviation of €. 

In what follows we deal with a sequence {G,,} of (simple) graphs. We 
abbreviate €(G,,), w(Gn) and o(G,,) to En, fn and op, and in addition, set 
IV(Gn)| = Un, |E(Gn)| = en, D(Gn) = Dy and v(Gn) = Yn. To avoid 
trivialities we always assume v,, + 00 (n > 00). 

In 1981 C. Godsil [43] gave sufficient conditions for asymptotic normality 
of the distributions {p;(G,,)}: 


Theorem 15. If D?/e,, — 0 then the distribution {px(Gn)}k>0 of matching 
sizes in Gy is asymptotically normal. 
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That is, for each x € R 


En — Pn 1 [ —t?/2 
P < > e dt (no). 
r( = x) oa ( ) 


The same conclusion was obtained by Rucirtiski [81] under a weaker 
hypothesis: 


Theorem 16. If ¥,/Dn — oo then {px(Gn)}e>0 is asymptotically normal. 
The main result of [63] is a necessary and sufficient condition: 


Theorem 17. The distribution {px(Gn)}x>0 is asymptotically normal if and 
only if 
Un — [ln —> 00 (n + Ov). (21) 


Let p(G,x) denote the probability generating function of the sequence 
{pk(G)}, 
u(G) 


p(G,x) = » pr(G)a*. 
k=0 


A fundamental fact, proved in [51] (see also [50]) and [72], is 
for every G, p(G, x) has real roots. (22) 


The significance of this for our discussion was first noticed by L. Harper [49] 
in his proof of asymptotic normality of the sequence {S(n,k)/Bn}x>1 (with 
S(n,k) the Stirling number of the second kind and B,, the Bell number). 
Harper’s neat observation is that, given (22), a necessary and sufficient 
condition for asymptotic normality of {px(Gn)}x>0 is that 


On > ©. (23) 


Thus Godsil and Ruciriski just need to prove (23) under their respective 
hypotheses (in both cases the key is the fact, shown in [51], that if 
p(G, xo) = 0, then |ao| > 4(D(G) — 1)), while proving Theorem 17 amounts 
to bounding vy — 1 as a function of o. (The current proof gives v — pu = O(0°); 
curiously, one can have v — pas large as Q(n®), which seems likely to be the 
truth.) 

Having said this, let us stress that, as illustrated by the next result, 
Theorem 17 is by no means a matter of replacing one unverifiable condition 
by another. (We write 6, for the minimum degree of G,.) 


Corollary 3. Each of the following implies asymptotic normality. 


(a) Vn/Dyn > 
(b) bn = (1 — 0(1))Dn 
(c) Up > (A = o(1))vn /2 


(The reader might try to see why each of (a)—(c) implies (21).) 
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Thus we recover Theorems 15 and 16, and for example have asymptotic 
normality for any sequence of regular graphs—note Theorems 15 and 16 do 
not apply to sequences of regular graphs for which degree grows in proportion 
to the number of vertices—or graphs with perfect matchings. The latter 
includes Harper’s theorem (again, see [43] or [79, p. 213] for the connection). 
Slightly unbalanced complete bipartite graphs show that (c) can’t be relaxed 
to “Vv, > (1 —€)vp,/2” if e > 0 is fixed. 

We don’t have space to say much about the proof of Theorem 17. 
As of now it is not very easy, though there are some special cases—e.g., 
sequences of regular graphs or sequences as in Theorems 15 and 16—for which 
the methods of [63] give fairly simple proofs. Though there’s no concrete 
connection between these methods and the guided-random approach of earlier 
sections, they do have in common a reliance on having a lot of approximate 
independence in the relevant probability spaces. For the earlier results, this 
independence is usually derived from something like (11). For Theorem 17, 
and also for Theorem 9 (see Lemma 1), the required independence derives 
from the following simple fact. 

Let M be a random matching drawn from a normal distribution on a 
graph G, and for vertex x, set p(x) = Pr(a ~ M). Forx €V,W C V(G), 
set p(W) = pa(W) = Cyew pw) and p(W|z) = po—2(W \ {e}). (Thus 
u(W) is the expected number of vertices of W covered by M, while w(W|z) 
is the same number conditioned on {x £ M}.) 


Lemma 2. Ifa ¢ W, then |u(W) - u(W|z)| < p(x). 


In particular, conditioning on {x A M} changes p(W) by at most 1. 

As noted above, Theorem 17 provides the first proof of (23) for sequences 
of regular graphs. In fact, as shown even more recently in [67], the values of 
and o? for a regular graph are remarkably well determined just by degree 
and number of vertices: 


Theorem 18. For any d-regular simple graph G, 


(a) v(G) — 2u(G) ~ o(G)/Vva, 
(b) 07(G) ~ o(G)/(4Vd) 


(limits taken as d > co). 
Actually (a) is a consequence of the finer 
p(Z) = Pr(xw & M)~d-/? Wx € V(G). 


(It’s worth stressing once more the use of the uniformity convention: the rates 
of convergence in the last three assertions depend on nothing but d.) 

The proof of Theorem 18 is again based in part on martingale concen- 
tration results, in this case for martingales related to random self-avoiding 
walks on the graphs in question. (The connection takes a little too long to 
discuss here, but see [44] for an indication of the relation between matchings 
and self-avoiding walks.) 
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Let us close with a conjecture and a question which take us back to 
some of the topics discussed earlier. (Though we won’t pursue the subject 
here, it would also be of considerable interest to understand to what extent 
Theorem 17 continues to hold for k-bounded hypergraphs; see [63] for some 
speculation as to what might be true in this direction.) 


Conjecture 8. For fired k and simple, k-uniform, D-regular H, 
p(o)~D-VF We V(H). 


If we relax “simple” to (11), then the conclusion should be p(%) — 0, 
which, like Theorem 11, would say that the value of v predicted by Theorem 7 
is actually the average size of an appropriately defined random matching. 
Given the sophistication of the distribution of Theorem 11 (and the difficulty 
of Theorem 7 itself), it would be extremely interesting if uniform distribution 
accomplished the same thing. 

Finally, an affirmative answer to the following would imply (18) (in the 
same way that Theorem 11 implies Theorem 10), and would also be very 
interesting in its own right. 


Question 1. Is it true that for each €¢ > 0 there exist t and Do such that 
for every D > Do and D-regular multigraph G with x'*(G) = D, there is a 
probability distribution p on M = M(G) satisfying 


(a) pa=-.1/D VA€ E(G), and 
(b) For M chosen according to p, and A € H, the event {A € M} is 
independent of the events {{B € M}: A(A, B) >t}? 


Added in Proof 


Conjecture 7 for bipartite multigraphs, so in particular the Dinitz Conjecture 
(Conjecture 5), was proved by Fred Galvin around the end of 1993 [F. Galvin, 
The list chromatic index of a bipartite multigraph, J. Combinatorial Th. (B) 
63 (1995), 153-158}. 

Anders Johansson [A. Johansson, An improved upper bound on the choice 
number for triangle free graphs, manuscript, 1994], again along the lines of 
the proof of Theorem 12, proved x;(G) = O(D/log D) for triangle-free G 
(compare Theorem 13). 

Another major application of the semirandom method discussed in 
Sects. 4 and 6 was given in [J.H. Kim, The Ramsey number R(3,t) has order 
of magnitude t?/logt, Random structures and Algorithms 7 (1995), 173-207]. 

Joel Spencer [J . Spencer, Asymptotic packing via a branching process, 
Random structures and Algorithms 7 (1995), 167-172.] showed that the 
“natural” proof suggested after Theorem 7 does indeed work. 
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A simpler proof of Theorem 8, based on the [77] proof of Theorem 7, was 
given in [J. Kahn, A linear programming perspective on the Frankl-Rédl- 
Pippenger Theorem, Random Structures and Algorithms 8 (1996), 149-157]. 

A proof of (18), based on normal distributions and the approximate 
independence results of [66], was given in [J. Kahn, Asymptotics of the 
chromatic index for multigraphs, J. Combinatorial Th. (B) 68 (1996), 
233-254]. 

The list-coloring version of (18) (so Conjecture 7 for multigraphs) was 
proved in [J. Kahn, Asymptotics of the list-chromatic index for multigraphs, 
Random Structures & Algorithms 17 (2000), 117-156]. 

Several of the above results, together with further applications of the 
ideas sketched in Sect.6, are discussed in detail in [M. Molloy and B. Reed, 
Graph Colouring and the Probabilistic Method, Springer, Berlin, 2002.] 
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1. Introduction 


The origins of the theory of random graphs are easy to pin down. Undoubtedly 
one should look at a sequence of eight papers co-authored by two great 
mathematicians: Paul Erdés and Alfred Rényi, published between 1959 and 
1968: 


ER59] On random graphs I, Publ. Math. Debrecen 6 (1959), 290-297. 

ER60] On the evolution of random graphs, Publ. Math. Inst. Hung. Acad. 
Sci. 5 (1960), 17-61. 

ER6la] On the evolution of random graphs, Bull. Inst. Internat. Statist. 38, 
343-347. 

ER61b] On the strength of connectedness of a random graph, Acta Math. 
Acad. Sci. Hungar. 12 (1961), 261-267. 

ER63] Asymmetric graphs, Acta Math. Acad. Sci. Hung. 14, 295-315. 

ER64] On random matrices, Publ. Math. Inst. Hung. Acad. Sci. 8 (1964), 
455-461. 

ER66] On the existence of a factor of degree one of a connected random 
graph, Acta Math. Acad. Sci. Hung. 17 (1986), 359-368. 

ER68] On random matrices I, Studia Sci. Math. Hung. 3 (1968), 459-464. 


Our main goal is to summarize the results, ideas and open problems 
contained in those contributions and to show how they influenced future 
research in random graphs. 

For us it was a great adventure to return to the roots of the theory of 
random graphs, and to find out again and again, how far-reaching the impact 
of Erdés and Rényi’s work on the field is. The reader will find in our paper 
many quotations from their original papers (always in italics). We use this 
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convention to let them speak directly and to preserve their special insightful 
style and way of thinking and stating the problems. Starting from there we 
lead the reader through the literature, including the most current one, trying 
to show how the ideas of Erdés and Rényi developed, how much time, skills 
and effort to solve some of their most challenging open problems was needed. 
Finally, to add some “salt and pepper” to our presentation, full of admiration 
and respect, we point out to a few false statements and oversimplifications 
of proofs, which have been found in their monumental legacy by the next 
generations of random graph theorists. 


2. The First Question: Connectivity 


Although the notion of a random graph appeared in connection to the 
probabilistic method already in the Erdés paper [25] (see J. Spencer’s article 
in this volume), it was forgotten for a decade until Paul Erdés and Alfred 
Rényi published a series of papers entirely devoted to properties of random 
graphs. The model of a random graph they exclusively investigated was the 
uniform one. Here is how they defined it: “Let E,,n denote the set of all 
graphs having n given labeled vertices and N edges. A random graph Tn.n 
can be defined as an element of En,n chosen at random, so that each of the 
elements of En,.n have the same probability to be chosen, namely 1/(@)” 
(In this paper we adopt the original notation T’,,1.) 

They were aware of existing results about other models of random graphs. 
In particular, they acknowledge in a footnote to [ER61a] that E. N. Gilbert 
[36] studied the connectedness of what we call today the binomial model, 
where “We may decide with respect to each of the (5) edges, whether they 
should form part of the random graph considered or not, the probability of 
including a given edge being p = N/(5) for each edge and the decisions 
concerning different edges being independent.” (In this paper we shall denote 
this model by Ty.) In [ER61la] they mention that the investigations of 
the binomial model can be reduced, due to a conditional argument they 
attribute to Hajek, to that of In,~. However, they did not formulate 
any equivalence theorem (these appeared much later in [14] and [59]) and 
occasionally stated the binomial counterparts of their theorems without 
proofs or repeated their proofs step by step. 

Apparently they were not aware of the result of Gilbert and of the bino- 
mial model at all when they wrote their first paper on random graphs, “On 
random graphs I”. The question addressed there was that of connectedness of 
a random graph. In fact, according to a remark in [ER59], this problem was 
tried and partially solved already in 1939, when P. Erdés and H. Whitney, 
in an unpublished work: “proved that if N > (5 + €) nlogn where « > 0 
then the probability of [n,n being connected tends to 1 if n > ov, but if 
N< (5 = €) nlogn with ¢ > 0 then the probability of [n,n being connected, 
tends to 0 if n > oo.” 
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In the first “official” paper on random graphs, Erdés and Rényi refined 
the above result as their (partial) answer to questions 1-3 from the following 
list of problems they posed. 


1. What is the probability of [,,n being completely connected? 

2. What is the probability that the greatest connected component (subgraph) 
of [nw should have effectively n — k points? (k =0,1,...) 

3. What is the probability that l,j should consist of exactly k +1 connected 
components? (k = 0,1,...) 

4. If the edges of a graph with n vertices are chosen successively so that after 
each step every edge which has not yet been chosen has the same probability 
to be chosen as the next, and if we continue this process until the graph 
becomes completely connected, what is the probability that the number of 
necessary steps v will be equal to a given number 1? 


Note that in problem 4 Erdés and Rényi describe a genuine random graph 
process, whose advanced analysis could be carried over only two decades later. 

Before turning to the proofs, they recall a recursive formula and a 
generating function for the number C(n, N) of connected graphs on n labeled 
vertices and with N edges, due to Riddell and Uhlenbeck, and also Gilbert. 
But immediately they comment that neither of them “...helps much to 
deduce the asymptotic properties of C(n, N). In the present paper we follow 
a more direct approach.” 

We now present the first result on random graphs and its proof in a 
slightly modified form. The idea of the proof, however, remains unchanged. 
In the 1959 paper only the middle part of the theorem below was stated 
explicitly. The other two follow by letting c = cn tend to +o0 or —oo, 
respectively. 


Theorem 2.1 ([ER59]). 


0 if % — 5 logn + —oo 
P(In,w is connected) >< e-@* if N- Llogn +c 


1 if % — Slogn + co. 


Proof. For convenience we switch to the binomial model, shortening the 
original argument a lot, and, at the same time, avoiding a harmless error 
in the proof of “the rather surprising Lemma” of [ER59], pointed out by 
Godehardt and Steinbach [37]. 

To make this argument formal, assume that 2np — logn — log logn — oo 
but np = O(logn). Thus, almost surely (i.e., with probability tending to 1 
as n —> oo), there are no isolated edges in T,,,. What remains to be shown 
is that there are no components of size 3 < k < 4% either. To this end 
consider the random variable X counting such components. Then, bounding 
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the probability that a given set of k vertices spans a connected subgraph by 


k*~2pk—!, and using the inequality np > $ log n, we obtain 
ae n en\k 
k-2,k—-1/4 _ »\k(n—k) k-2,,.k—-1,,—(n—k)pk 
Exp(x) < » GL p*—-(1—p) < > ( i ) kX “p"—*e Pp 


vn k n 
1 1 enp 1 1s enp \k 
< 
~ p » k2 (<hr) e D » n (<r) 
Va 


of log? n it 1 elogn ve (1) 
= = o(1). 
logn n3/2 logn \ 2n1/4 
Hence, almost surely there are no components outside the largest 
one other than isolated vertices (Erdés and Rényi say that such a graph 


is of type A) and the threshold for connectedness coincides with that for 
disappearance of isolated vertices, i-e., for 2np — logn — loglogn — oo 


P(In,p is connected ) = P(6(T'n,p) > 0) + 0(1). 


Erdés and Rényi found the limiting value of P(d([,,,) > 0) by inclusion- 
exclusion. Nowadays a standard approach is by the method of moments 
which serves to show that the number of isolates is asymptotically Poisson. 
They used that method in the 1960 paper in a more general setting where 
components isomorphic to a given graph G were considered. We shall return 
to this later. 

Answering question 4, they gave a somewhat oversimplified proof of the 
fact that 


— tn) ae 
lim P (7 wisest < c) =e ° : 


n—-co n 


Erdés and Rényi conclude the 1959 paper as follows. “The following more 
general question can be asked: Consider the random graph Ty n(n) with n 
possible vertices and N(n) edges. What is the distribution of the number of 
vertices of the greatest connected component of Pn, n(n) and the distribution of 
the number of its components? What is the typical structure of Vn n(n) (in the 
sense in which, according to our Lemma, the typical structure of Vy n(n) 18 
that it belongs to type A)? We have solved these problems in the present paper 
only in the case N(n) = 4nlogn-+cn. We shall return to the general case in 
another paper [8].” ({8] =[ER60] on our reference list.) 

As far as connectedness is concerned, in the 1961 paper Erdés and 
Rényi go on and find the threshold for r-connectivity of I, for every 
natural r. “If G is an arbitrary non-complete graph, let c)(G) denote the 
least number k such that by deleting k appropriately chosen vertices from 
G (...) the resulting graph is not connected. (...) Let ce(G) denote the 
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least number | such that by deleting | appropriately chosen edges from G 
the resulting graph is not connected.” A graph is r-connected if no removal 
of r or less vertices can disconnect it. When the random graph becomes 
almost surely r-connected? Theorem 2.1 revealed an interesting feature of 
random graphs. Namely, quite often trivial necessary conditions become 
asymptotically sufficient in the sense that for a typical, large graph their 
fulfillment guaranties that the property in question holds. Due to Theorem 2.1 
this is the case of connectedness versus the nonexistence of isolated vertices. 
For r-connectedness such natural necessary condition is that the minimum 
degree (denoted in [ER61b] by c(G)) must be at least r. Otherwise removing 
the vertices adjacent to a vertex of minimum degree would disconnect the 
graph. Erdés and Rényi showed in 1961 that in the range snlogn <N< 
nlogn this is the only way one can disconnect the random graph [,,y by 
removing the smallest possible number of vertices. A minimal cutset is a set 
of vertices whose removal makes the graph disconnected but no proper subset 
of that set has this property. For 2<k < not let Ax be the event that there 
is in T’,,y a minimal cutset of size s, 1 < s < r—1, which leaves the second 
largest component of size k. Arguing similarly as in the proof of Theorem 2.1, 
they proved that P(U,55 Ax) = o(1), meaning that, almost surely, if Py,~ 
is not r-connected then the only reason for that is the presence of vertices of 
degree less than r. The method of moments (again, in the inclusion-exclusion 
cover-up) gives that, for N(n) = $nlogn + Snloglogn + an + o(n), their 
number is asymptotically Poisson. We thus arrived at the main result of the 
1961 paper. 


Theorem 2.2 ([ER61la]). If we have 
1 r 
N(n) = grlogn -- gi log logn +an+ o(n) 


where a is a real constant and r a non-negative integer, then 


e724 
Jim, Plep(P.ve) =") = 1 ex (—), (3) 
further 
e724 
Jim P(ce(Un,n(n)) = 7) = 1— exp (- _ ) (4) 
and 
e724 
Jim P(c(Pn,n(n)) = 7) = 1 — exp (- a ) : (5) 


In a proceeding remark they promise: “The statement (5) of Theorem 2.2 
gives information about the minimal valency of points of n,n. In a forth- 
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coming note we shall deal with the same question for larger ranges of N 
(when c([',,~) tends to infinity with n), further with the related question about 
maximal valency of points of 1,7.” This promise was never fulfilled. The only 
trace of their interest in the vertex degrees of a random graph can be found in 
the description of the last phase of the evolution of T,, 7 in [ER61a]: “Phase 5. 
consists of the range N(n) ~ (nlogn)w(n) where w(n) > oo. In this range 
the whole graph is not only almost surely connected, but the orders of points 
are almost surely asymptotically equal. Thus the graph becomes in this phase 
‘asymptotically regular’.” The proof of that statement can be found in the 
last section of [ER60]. A very careful analysis of vertex degrees in a random 
graph is due to Bollobas [10, 11] and can be found also in his book [14]. 


3. Subgraphs: The Beginning of a Theory 


After having written their paper on connectivity of a random graph Erdés and 
Rényi decide to write a long paper addressing several properties of random 
graphs. That seminal paper was preceded by an extended abstract [ER61al, 
where they outlined the main goals of the theory to be born. “Our main goal 
is to show (...) that the evolution of a random graph shows very clear-cut 
features. The theorems we have proved belong to two classes. The theorems of 
the first class deal with the appearance of certain subgraphs (e.g., tress, cycles 
of a given order etc.) or components, or other local structural properties, and 
show that for many types of local structural properties A a definite ‘threshold’ 
A(n) can be given, so that if aa > 0 for n > co then the probability 
that the random graph T;, n(n) has the structural property A tends to 0 for 


n — oo, while for ae => oo for n > co the probability that the random 


graph Ty n(n) has the structural property A tends to 1 for n — oo. (...) 
The theorems of the second class are of similar type, only the properties A 
considered are not of a local character, but global properties of the graph 
Prin) (€-g-, connectivity, total number of components, etc.).” The existence 
of a threshold in all cases they considered was a rather surprising fact for 
Erdoés and Rényi. Only three decades later it was proved by Bollobds and 
Thomason [19] that, as a consequence of the Kruskal-Katona inequality, every 
monotone property (family) of random subsets of a set has a threshold in the 
above sense. 

In the same abstract they comment that their proofs are “... completely 
elementary, and are based on the asymptotic evaluation of combinatorial 
formulae and on some well-known general methods of probability theory ....” 

The first theorem of the major paper [ER60] established the threshold for 
the existence of a subgraph of a given type for a broad class of subgraphs. 


“Tfa graph has n vertices and N edges, we call the number 2N the ‘degree’ of 
t 2N 
n 


oo 


the graph (as a matter of fac is the average degree of the vertices of G.) 
If a graph G has the property that G has no subgraph having a larger degree 
than G itself, we call G a balanced graph.” 
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Theorem 3.1 ([ER60]). Let k > 2 andl (k-1<I1< (5) be positive 
integers. Let By, denote an arbitrary not empty class of connected balanced 
graphs consisting of k points and | edges. The threshold function for the 
property that the random graph considered should contain at least one 


subgraph isomorphic with some element of By. is ne-t, 


Among special cases they mention trees, connected unicyclic graphs, 
cycles, complete graphs and complete bipartite graphs all of which are 
balanced. Over 20 years later, Bollobds [9] generalized this theorem to 
arbitrary (not only balanced) graphs. He, however, used a rather complicated 
method. In 1985, to a great surprise to all involved, Ruciriski and Vince [73] 
found out that the original proof of Erdés and Rényi which was based on 
the second moment method can be easily adapted to cover all graphs as well. 
We now state that result in the binomial model. 


Theorem 3.2 ([9]). For an arbitrary graph G with at least one edge, 


lim P(G CTnp) = 


n> co 


0 ifp=o(n-"/me) 
1 ifn-V/™e = o(p), 
where mg = maxyce dy and dg = TaGIE 
A crucial role in the Rucinski-Vince proof of Theorem 3.2 is played by 
the quantity ®g = mingcg Exp(Xz) . In fact, the inequalities 


1-—@¢g< PIG ETnp) < a/fe 
obtained in that proof have been strengthened to exponential bounds 
eee <PiG lig) =e 


where the L-H-S follows by the FKG inequality and the R-H-S is a special 
case of a recent inequality from [42]. 

As far as the asymptotic distributions of subgraph counts are concerned, 
Erdés and Rényi treated in [ER60] only trees and cycles. For trees of order k 
they established a limiting Poisson distribution on the threshold N ~ cnt, 
They observed that the same result holds for isolated trees, since in this 
range almost surely all k-vertex trees are isolated (i.e., are components of the 
random graph). They also found another Poisson threshold for isolated trees 
at N= sz n log n- i n log log n+cn-+o(n), beyond which isolated trees die 
out (swallowed by the giant component on its way to absorb all the vertices 
of the random graph). They also established an asymptotic normality of the 
number of isolated trees of order k (after suitable standardization) in the 
whole range of N between the two thresholds. As observed by A. Barbour 
in [5], the proof given by Erdés and Rényi was not correct and in the range 
N ~ cn, c# 1/2, the standardization was not right. However, using another 
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method Barbour showed that indeed the asymptotic normality holds in the 
entire range in question. For cycles and isolated cycles they established 
a Poisson distribution (different in each case) at N ~ cn and observed 
that contrary to isolated trees, “...the probability that Tp, contains an 
isolated cycle of order k never approaches 1.” A similar result was proved for 
connected unicyclic graphs. All these results were obtained by the method 
of moments based on a fact from probability theory that for all distributions 
which are uniquely determined by their moments (Poisson and normal are 
such) the convergence of all moments of a sequence of random variables 
to the moments of that distribution implies convergence in distribution 
[8, Theorem 30.2]. Erdés and Rényi prove this fact as a lemma just for the 
Poisson distribution, although they use it also for the normal distribution. 
At the end of the paper, in a remark added in proof, they acknowledge that 
N. V. Smirnov proved this lemma already in 1939. 

They conclude their investigations of local properties of random graphs 
with the comment: “Similar results can be proved for other types of subgraphs, 
e.g., complete subgraphs of a given order. As however these results and their 
proofs have the same pattern as those given above we do not dwell on the 
subject any longer and pass to investigate global properties of the random 
graph Tl, n.” In 1979, K. Schiirger, a former Ph.D. student of Erdés, proved 
similar results for complete subgraphs [74] and a few years later Karoriski [47] 
extended them to so called k-trees, a common generalization of trees and 
complete graphs. All these particular cases led to a general result for all 
strictly balanced graphs. A graph is strictly balanced if every proper ee 
has its degree strictly smaller than the graph itself. Let us denote dg = wet} 
and recall that X@ is the number of copies of G in a random graph I; p. 
The following result was proved independently in [9] and [48]. 


Theorem 3.3 ([9, 48]). If G is a strictly balanced graph and np*?¢ > c > 0 
then Xq@ converges to the Poisson distribution with expectation Oe 


If a graph G is balanced but not strictly balanced then the limiting 
distribution of Xg on the threshold, i.e. when p = O(n~!/4¢), becomes quite 
involved. Although, in principle, as shown by Bollobds and Wierman [20], it 
can be computed, there is no nice closed formula. For example, when G is a 
disjoint union of 2 triangles then the limit distribution is that of the random 
variable ay where Y is Poisson. When G is the triangle with a pendant edge, 
the limit is ae Y;, where all random variables involved are independent 
and Poisson. When G is the triangle with two pendant edges hanging at 
the same vertex then X@ converges to the distribution of = Ca where 
again all random variables are independent Poisson. One more example: if 
G is the triangle with a path of length 2 hanging at one of it vertices, then 


U 
the limit distribution is that of yaa ’ Y;, where all random variables are 
independent Poisson. We can only hope that so far the reader is convinced 
that a pattern does indeed exist. 
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If G is nonbalanced, then the expectation of X¢ tends to infinity and one 
has to normalize. It turns out that there is a nonrandom sequence a,,(G') — oo 
such that the asymptotic distribution of a coincides with that of Xz, 
where H is the largest subgraph of G for which dy = mga. Clearly, H is 
balanced and we are back to the balanced case. The sequence a,(G) is equal 
to the expected number of extensions of a given copy of H to a copy of G in 
the random graph I’,,,,. For details see [71, page 292]. 

Beyond the threshold, i.e., when np”? — oo, Xq@ converges after 
standardization to the standard normal distribution as long as n?2(1—p) > oo. 
(For bigger p Xq is either Poisson or degenerate, according to the formula 
Xq~ (*) TG) —Cn(G)Z, where Z is the binomial random variable counting 
edges in the complement of T’,,, and cp(G) is the number of copies of G in K,, 
containing a fixed edge. For details see [70].) This result was supplemented 
by the rate of convergence in [7]. It was shown there that the total variation 
distance between standardized Xq and the standard normal distribution can 
be bounded by O( sss) as long as p # 1 and by Os) otherwise. Recall 
that ®¢ — oo if and only if np™? > oo. 

A variant of the small subgraph problem is one when we only count 
induced subgraphs of T,,, which are isomorphic to G (induced copies). 
Let Y¢ count them. Then, denoting v = |V(G)| and 1 = |E(G)|, Exp(Yce) = 
Exp(Xg)(1—p) 4 and as long as p + 0 there is no substantial difference in 
the limiting distribution of Xg and Yq. For p constant, however, interesting 
things may happen. First of all, in contrast to X@, the variance of Ye may 
drop below the order of n?’~?. It does so when Exp(I|Ji2) = Exp(1), ice., 
when p = 1/ or where J is the indicator of the event that there is an induced 
copy of G in T,,, on the vertex set {1,...,v} and Jj; is the indicator that 
the edge ij is present in T',,. But if Var(Yc) = O(n’) then still Ye is 
asymptotically normal, and only when the variance drops further down to 
the order of n?’~* the distribution of standardized Yg becomes nonnormal 
(the convolution of normal and xy? distributions). It is a purely combinatorial 
question when Var(Yg) = @(n?"~*). For the higher terms to cancel out one 
needs that Exp(I|Ji2, Ji3, Jo3) = Exp(I), or, equivalently, that in addition 
to p =1/($), the proportion t3 : t2 : t1 : to = p® : 3p?q : 3pq? : q? is satisfied, 
where t; is the number of induced subgraphs of G isomorphic to the graph 
with 3 vertices and i edges. For p = $, an example of a graph satisfying 
these requirements is the wheel on 8 vertices, i.e. the graph obtained from 
the 7-cycle by joining a new vertex to every vertex of the cycle. For some 
time it was an open question if such abnormal cases take place for every 
rational p. A positive answer to that puzzle is due to combined efforts of 
Janson, Kratochvil, Karrman and Spencer [41, 45, 49]. 

The random variables Xq@ and Yq are examples of sums of random 
variables with only few dependent summands. In particular, the summands 
forming Yq are dependent only if the sets corresponding to the indices 
intersect (on at least 2 vertices, in fact). The reason is that the property of 
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the vertex set we are after depends only on the presence and absence of the 
edges within the set. The situation changes when we move to the properties 
depending also on the pairs with one endpoint in the set. Then all summands 
are mutually dependent, but most just weakly. We have already encountered 
such a case when studying the number of components of I’, which are 
isomorphic to a given graph G. Clearly this property requires that there is 
no edge with one endpoint in the set of vertices of a copy of G. Another 
example of such “semi-induced” property is the notion of a maximal clique. 
This is a complete subgraph not contained in any bigger complete subgraph 
of a graph. For a vertex set to span a maximal clique one needs that no other 
vertex is adjacent to all the vertices of the set. In [6] the limiting distribution 
of the number of maximal k-cliques was investigated. It was proved that 
for k > 2 there are two Poisson thresholds for the existence of maximal k- 
cliques and the phase of asymptotic normality between them. Finally, there 
are characteristics which lead to sums of random variables indexed by vertex 
sets, which each depend on the presence or absence of all the edges in Typ. 
An example of this is the number of copies of G disjoint from all other copies 
of G in T’,,,,. Here even the expectation is difficult to obtain, and the limiting 
normal distribution is still beyond ones reach. 


4. Phase Transition 


Sections 4-9 of [ER60] are devoted to global properties of random graphs. 
The proofs follow the same pattern. First, the expectation of the quantity in 
question is asymptotically evaluated. Then, using Markov’s and Chebyshev’s 
inequality (the first and the second moment method, resp.) the asymptotics 
of the quantities themselves are derived. As a summary of these results we 
quote here how Erdos and Rényi characterize the process of the evolution 
of a random graph in the paper presented to the International Statistical 
Institute meeting in Tokyo in 1961 [ER61a]: 

“Tf n is fixed large positive integer and n is increasing from 1 to aF the 
evolution of [n,n passes through five clearly distinguishable phases. These 
phases correspond to ranges of growth of the number N of edges, these ranges 
being defined in terms of the number n of vertices. 


e Phase 1 corresponds to the range N(n) = o(n). For this phase it is 
characteristic that TV, n(n) consists almost surely (i.e. with probability 
tending to 1 as n + +00) exclusively of components which are trees. (...) 

e Phase 2 corresponds to the range N(n) ~ cn with0 <c < 1/2. (...) 
In this range almost surely all components of Tn,n(n) are either trees 
or components consisting of an equal number of edges and vertices, i.e. 
components containing exactly one cycle. (...) In this phase though not 
all, but still almost all (i.e. n — o(n)) vertices belong to components which 
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are trees. The mean number of components is n — N(n) + O(1), te. in 
this range by adding a new edge the number of components decreases by 1, 
except for the finite number of steps. 

e Phase 3 corresponds to the range N(n) ~ cn with c > 1/2. When N(n) 
passes the threshold n/2, the structure of Pn n(n) changes abruptly. As a 
matter of fact this sudden change of the structure of Pn n(n) 18 the most 
surprising fact discovered by the investigation of the evolution of random 
graphs. While for N(n) ~ cn with c < 1/2 the greatest component of 
Tn,N(n) is @ tree and has ( with probability tending to 1 as n + +00) 
approximately 4 (logn - 3 log log n) vertices, where a = 2c — log 2c, for 
N(n) ~ n/2 the greatest component has (with probability tending to 1 as 
n — +00) approximately n?/3 vertices and has rather complex structure. 
Moreover for N(n) ~ cn with ¢ > 1/2 the greatest component of Pn nn) 
has (with probability tending to 1 as n — +00) approximately G(c)n 
vertices, where 


(clearly G(1/2) =0 and lime++o0 G(c) = 1). 

Except this “giant” component, the other components are all relatively 
small, most of them being trees, the total number of vertices belonging to 
components, which are trees being almost surely n(1 — G(c)) + o(n) for 
eS 1/2.) 

The evolution of Vn n(n) in Phase 3. may be characterized by that the 
small components (most of which are trees) melt, each after another, into 
the giant component, the smaller components having the larger chance of 
“survival”; the survival time of a tree of order k which is present in TD, n(n) 
with N(n) ~ cn, ¢ > 1/2 ts approximately exponentially distributed with 
mean value n/2k. 

e Phase 4 corresponds to the range N(n) ~ cnlogn with c < 1/2. In this 
phase the graph almost surely becomes connected. (...) 

e Phase 5 consists of range N(n) ~ (nlogn)w(n) where w(n) > +00. 
In this range the whole graph is not only almost surely connected, but the 
orders of all points are almost surely asymptotically equal. Thus the graph 


m9 


becomes in this phase “asymptotically regular”. 


Erdés and Rényi in their fundamental paper [ER60] gave a fairly complete 
“big picture” of the evolution of a random graphs. However many fascinating 
questions were left unanswered. For example, how did the giant component 
grow so rapidly, what is the nature of the “double jump” of its size: from 
O(logn) when c < 1/2 to O(n?/%) when c = 1/2 and finally being of the 
order of n when c > 1/2? 
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Often we say that a random graph goes through the phase transition at 
c = 1/2 due to an obvious resemblance of this period of its evolution to the 
physical phenomena of changing the state, for example, from liquid to solid. 
Here a random graph changes abruptly its state from a loose collection of 
small components being trees and unicyclic to solid single giant component 
dominating its structure. 

The critical moment of the phase transition was unresolved until the 
milestone paper of Béla Bollobas [13] who revealed the mechanism of the 
formation of the giant component. He also focused the attention, for the first 
time, on the nature of the phase transition phenomena, investigating this 
critical moment of the evolution and looking at the beginning of so called 
supercritical phase. He asked what is the typical structure of a random graph 
Pn,v when N(n) = 5n+ , where s = o(n). In particular he proved that 
the largest component is almost surely unique once s > 2(logn)!/?n?/3 and 
its size Li([,,~) is approximately 4s while the size of the second largest 
component L2(T,,.) is much smaller. 

Bollobas gave a good lead to what we might consider as the proper 
magnification if we want to get undistorted picture of the phase transition 
while looking at the neighborhood of the “critical point” n/2. Due to later 
results of Luczak [58], combined with those of Kolchin [51], we know that the 
correct parametrization is 


1 
N(n) = 5” + An?/3, 


When A — —oo then [';,y consists of many components of the same 
size as the largest one, which is still very small and consists roughly of 
ce log(s?/n?) vertices, and the large components are unable to “swallow” 
each other and therefore are forced to hunt for smaller query. Hence large 
components grow absorbing only small ones and no clear favorite to win the 
race for the giant emerges. As the number of edges N(n) increases, the number 
of contestants decreases. When A = constant < 0 the probability that two 
specified large components will form a new component is bounded away from 
zero, but still too small to ensure the creation of unique giant component. At 
the same time, a big gap between the orders of large and small components 
arises which prevents the creation of new large components from the small 
ones. Next, as soon as A — oo, all large components almost “instantly” merge 
together and a unique large component emerges. This component is still not 
giant, it has barely over n?/3 vertices, but it will continue to absorb other 
components, first the largest ones, rapidly becoming giant. 

The next result of Luczak [58] gives a clear picture of the sizes L;([n,~) 
of the ith largest components during the phase transition of I’, y. Here and 
throughout the paper the abbreviation a.s. stands for ‘almost surely’, a phrase 
whose precise meaning was explained in the description of Phase 1. above. 
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Theorem 4.1 ([58]). Let k be natural number and sn-?/3 —+ 00 but s = 
o(n). 
(i) If N=n/2—<s then for everyi=1,2,...,k and every real r 


: n? e 45 Ce a _y 
Jim P (2:(0..v) < 352 (108 5 - 3 slog +r)) a » rik : 


where X = X(r) = 2/\/me~". 
Moreover, a.s. the ith largest component of [n,n is a tree fori = 
1,2,...,k andI,, n contains no component with more edges than vertices. 
(ii) Let N = n/2+5 and let s' be the unique positive solution of the equation 


2s’ s! 2s’ 2s! 
(.- =) j= (1+ =) ae 
n mr 


Then a.s. 
2(s+s')n n 
In(Un.nw) - — 
| 1,9) — SE) cot 
and so 
A 2 
Moreover, for every 1 =,2,...,k and every real r 
; n? se 5 3° oe i 
Jim P (4:(0..v) < 362 (108 55 - 3 slog +r)) — -» re : 


j=0 


where \ = \(r) = 2/./ne7". 

Furthermore a.s. the ith largest component of Tn, 1 = 2,3,...,k, 
is a tree and no component of n,n, except for the largest one, contains 
more edges than vertices. 


To study the critical “interval” when the phase transition takes place, 
i.e., when N(n) = $n + An?/3 and \ — oo, requires very sophisticated and 
delicate tools. Janson, Knuth, Luczak and Pittel in their extensive, almost 
140 pages long, study [40] applied machinery of generating functions with 
great success. They were able to analyze the structure of evolving graphs 
(and multigraphs) when edges are added one at a time and at random, with 
great precision, mainly looking and so called excess and deficiency of a graph. 
To give the reader a taste of their results let us quote the following theorem. 
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Theorem 4.2 ([40]). The probability that a random graph or multigraph 
with n vertices and $n + O(n'/3) edges has exactly r bicyclic components 
(i.e., components with exactly two cycles), and no components of higher cyclic 


order, 1s 
a 72 4 1/3 
(5) io eS 


They also study the following fascinating problem: What is the 
probability that the component which during the evolution becomes the first 
“complex” component (i.e., the first component with more than one cycle) 
is the only complex component which emerges during the whole process? 
So they ask what is the probability that the first bicyclic component is the 
“seed” for the giant one. They prove that it happens quite often indeed. 


Theorem 4.3 ({40]). The probability that an evolving graph or multigraph 
on n vertices never has more than one complex component throughout its 


: BT nw 
evolution approaches + ~ 0.8727 as n — ov. 


5. Planarity and Chromatic Number 


In a paper of such an enormous length one can likely find less rigorous claims. 
One of such things happened in the paper [ER60] in relation to the question 
when a random graph I, is planar. 

Since trees and components with exactly one cycle are planar, Erd6s and 
Rényi easily deduced from their findings about early stages of the evolution 
of a random graph, that when c < 1/2 then the probability that [1 is 
planar tends to 1. Now, to support the claim that when c passes 1/2 the 
graph becomes non-planar they used the argument that T,,, contains an 
induced cycle with d diagonals. Although their claim (Theorem 8a on page 
51) regarding the distribution of the number of such cycles is incorrect, as 
it was pointed out later by Luczak and Wierman [63], their intuition was 
perfect and the following result is indeed true. 


Theorem 5.1 ([63]). Let us suppose that N ~ cn. If c < 1/2 the probability 
that the graph Ty,n ts planar is tending to 1 while for c > 1/2 this probability 
tends to 0. 


Such a behavior of a random graph shows the fundamental difference 
in its typical structure before and after the phase transition. Now, thanks 
to the contribution of Luczak, Pittel and Wierman [64], we have more 
detailed knowledge about planarity of a random graph, also during the phase 
transition. 


Theorem 5.2 ([64]). Let € = e(n) +0 asn—- oo. Then Ty» is: 
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(i) a.s. planar, when p = (1 — €)/n, en > oo; 
(it) Planar with probability tending to a(A), 0 < a(A) < 1, as n > oo, when 
p=(1+e)/n, where en — and —c0 < X < 00 is a constant; 
(iii) a.s. non-planar, when p = (1 + €)/n,e8n > oo. 


In the final section of the paper [ER60] Erdés and Rényi collected 
unsolved problems. One of them is closely related to planarity: Another in- 
teresting question is: what is the threshold for the appearance of a “topological 
complete graph of order k”, i.e., of k points such that any two of them can 
be connected by a path and these paths do not intersect. For k > 4 we do 
not know the solution. The solution was found many years later by Ajtai, 
Kémlos, and Szemerédi [2]. 

Another problem mentioned there turned out to be one of the central 
and most challenging questions of the theory. Erdés and Rényi asked ” what 
will be the chromatic number of [n,n ?” What they knew then about this 
important graph invariant was limited to facts which can be deduced from 
general results regarding the evolutionary process. Here is what they were 
able to conclude : “Clearly every tree can be colored by 2 colors, and thus 
by Theorem 4a almost surely Ch(Tn,w) = 2 if N(n) = o(n). As however 
the chromatic number of a graph having an equal number of vertices and 
edges is equal to 2 or 3 according whether the only cycle contained in such 
graph is of even or odd order, it follows from Theorem 5e that almost surely 
Ch(T n,n) < 3 for N(n) ~ ne with c < 1/2. For N(n) ~ n/2 we have 
almost surely Ch([n,w) > 3. As a matter of fact, in the same way, as we 
proved Theorem 5b, one can prove that n,n contains for N(n) ~ n/2 almost 
surely a cycle of odd order. It is an open problem how large Ch(Tn,n) is for 
N(n) ~ n/2 with ec > 1/2.” 

This question remained open for next 30 years, and was answered, for 
large c, by Luczak in [57]. He proved that the chromatic number x(Ix,p) 
behaves as follows. 


Theorem 5.3 ([57]). Let np =c and e > 0 be fixed. Suppose c. < c+ o0(n) 
for sufficiently large constant c.. Then 


Cc 
P Pre) <1 
(Soe < xP nn) <9 


Although the original question was posed for sparse random graphs 
the ideas leading to the proof came from investigations of the chromatic 
number of dense random graphs. The first step toward the solution was 
made by Matula [66, 67] and Bollobds and Erdés [16] who discovered high 
concentration of the size of the largest independent set in I’, , around 2 log, n, 
where b = 1/(1—p) and edge probability p is a constant. It suggested that the 
respective lower bound for y(I',,,) should be n/(2 log, n). Only a few years 
later, Grimmett and McDiarmid published a paper [38] in which they showed 
that a greedy algorithm, which assigns colors to vertices of a random graph 


386 Michat Karoriski and Andrzej Ruciriski 


sequentially, in such a way that a vertex gets the first available color, needs, 
with high probability, approximately n/log,n colors to produce a proper 
coloring of I’,,,. It established an upper bound for the chromatic number 
of dense random graph, twice as large as the lower bound. Grimmett and 
McDiarmid conjectured that the lower bound sets, in fact, the correct order of 
magnitude for y(I',,,). The right tool to settle this conjecture was delivered 
by Shamir and Spencer [76]. They proved that the chromatic number of 
I, is sharply concentrated in an interval of length of order n'/? but, what 
perhaps was more important then their result itself, they introduced to the 
theory of random graphs a new powerful technique based on concentration 
measure of martingales, known in the probabilistic literature as Hoeffding- 
Azuma inequality. But it was Béla Bollobas who showed how the potential of 
martingale approach can be utilized to solve long standing conjecture. In his 
paper [15] he proved the following theorem. 


Theorem 5.4 ({15]). Let 0 < p < 1 be fixed and b = 1/(1—p). Then for 
every € > 0 


n 
>1loasn7>ow. 


<x(En.p) < (1+ 6) 


2 log, n 2 log, 7 


Later on Matula and Kucera [68] gave an alternative proof of the above 
theorem, using the second moment and “expose and merge” algorithmic 
approach. Luczak’s proof of Theorem 5.3 is in fact an ingenious blend of 
the martingale and “expose and merge” techniques. 

The chromatic number of a random graph is a random variable, the 
distribution of which should be highly concentrated. It is easy to notice 
(see above) that if p= o(n~') then y([n,») is 2 (not counting the case when 
the edge probability is of the order smaller then n~? and therefore, with high 
probability the graph is empty). One can also show that when p ~ cn“1, 
O<c<1 then P(y(Tn,») = 2) > a and P(x(Tnp) = 3) — 1-4, where 
a = e*/?((1—c)/(1+c))'/4. The last probabilities are simply the same as the 
probabilities that I’, has or does not have an odd cycle. Such a behavior of 
a random variable y has been confirmed, for small edge probabilities only, by 
Luczak. He proved in [61] that if p < n~5/®~€ then the chromatic number, as 
expected, takes on at most two values. 


6. Asymmetric Graphs 


Another interesting topic originated from a joint paper by Erdés and Rényi 
in the peak of their cooperation in early 1960s [ER63]. Here is how they 
describe their goals: “We shall call (...) a graph symmetric, if there exists 
a non-identical permutation of its vertices, which leaves the graph invariant. 
By other words, a graph is called symmetric if the group of its automorphisms 
has degree greater than 1. A graph which is not symmetric will be called 
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asymmetric. The degree of symmetry of a symmetric graph is evidently 
measured by the degree of its group of automorphisms. The question which 
led us to the results contained in the present paper is the following: how can 
we measure the degree of asymmetry of an asymmetric graph?” 

They answer the last question in what follows: “Evidently any asymmetric 
graph can be made symmetric by deleting certain of its edges and by adding 
certain new edges connecting its vertices. We shall call such a transformation 
of the graph its symmetrization. For each symmetrization of the graph let us 
take the sum of the number of deleted edges — say r — and the number of new 
edges — say s -; it is reasonable to define the degree of asymmetry A[G] of 
a graph G, as the minimum of r+ s where the minimum is taken over all 
possible symmetrizations of the graph G. (...) The question arises: how large 
can be the degree of asymmetry of a graph of order n (i.e., a graph which has 
n vertices)? We shall denote by A(n) the maximum of A[G] for all graphs G 
of order n(n = 2,3,...).” 

They first notice that A(2) = A(3) = A(4) = A(5) = 0 while A(6) = 
1. In general, a rather straightforward deterministic argument leads to the 
following result. 


Theorem 6.1 ([ER63]). 


To find the lower bound for A(n) Erdés and Rényi use a non-constructive 
argument, i.e., they show via the probabilistic method that there exists a 
certain graph on n vertices with the degree of asymmetry at least n(1—)/2, 
O<e<l. 


Theorem 6.2 ([ER63]). Let us choose at random a graph T having n given 


vertices so that all possible 2(3) graphs should have the same probability 
to be chosen. Let € > 0 be arbitrary. Let P,(e€) denote the probability that 
by changing not more than nt<) edges of T it can be transformed into a 
symmetric graph. Then we have 

lim P,(e) = 0. 

n—-+>Co 
Corollary 6.1. For any « with 0 < € < 1 there exists an integer no(e) 
depending only on €, such that for every n > no(e) there exists a graph G of 
order n with A[G] > n(1—«)/2. 


Indeed, for large n, Theorem 6.2 shows that almost every graph is a 
counterexample to the hypothesis that its symmetrization is possible with 
less than $(1 — o(1)) edges. 
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Hence, if we combine Theorem 6.1 and Corollary 6.1 we see that 


After showing that almost all labeled simple graphs are asymmetric, Erdés 
and Rényi turned their attention to graphs with a prescribed number of 
edges. First they noticed that since almost every tree has a cherry, i.e., a pair 
of pendant vertices adjacent to a common neighbor, therefore almost every 
tree on n vertices is symmetric. Furthermore they proved that any connected 
graph of order n having n edges is either symmetric or its asymmetry is one 
and gave the following bound. 


Theorem 6.3 ([ER63]). Jf a graph G of order n has N = An edges (0 < 
A < (n—1)/2) then 


A[G] < 4\ (1- =~). 


n—- 


Erdés and Rényi went further in their investigations. Let us quote a 
few more lines from their paper [ER63]. “Another interesting question is 
to investigate the asymmetry or symmetry of a graph for which not only the 
number of vertices but also the number of edges N is fixed, and to ask that 
if we choose one of these graphs at random, what is the probability of its 
being asymmetric. We have solved this question too, and have shown that if 
N = $(logn+w(n)), where w(n) tends arbitrarily slowly to +00 for n + +00, 
then the probability that a graph with n vertices and N edges chosen at random 


woe 
(so that any such graph has the same probability (()) to be chosen) should 
be asymmetric, tends to 1 for n + +o0. This and some further results will 
be published in another forthcoming paper.” 

Unfortunately the announced paper has never been published! Several 
years later this problem and the analogous one for unlabeled graphs was 
attacked again by Wright [78]. 

Consider graphs T’,,y and U;,n picked at random from the families of 
all labeled and unlabeled graphs on n vertices and with N = N(n) edges, 
respectively. Here is the result of Wright. 


Theorem 6.4 ([78]). Ifw(n) = (2N(n)/n)—logn — oo thenT yn and Un. 
are almost surely asymmetric while when w(n) < 0 then they are almost surely 
symmetric. 


Later Luczak [56] gave precise results about the structure of the 
automorphism group Aut([,,.) of a random graph T,,v. He studied the 
symmetry of the largest component L,(n,N) of this random graph. What 


he found was that when N(n) = na(n) then there exists a constant d such 
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that for a(n) > d almost surely Aut(Li(n, N) is isomorphic to some product 
of symmetric groups. From this result he was able to deduce the following 
strengthening of the “labelled” part of Theorem 6.4. 


Theorem 6.5 ([56]). Let N= $(logn+w(n)). 
(i) If w(n) > —oo then |Aut(Tn.n)| > 00 a.s. 
(ii) If wn) c then 


lim P(|Aut(Pn,v)| = 1) = e*(1+ 2) 


dM 
1 =k!) = — —x 
Jim P(|Aut(Tn,w)| = k!) ze 
for k = 2,3,..., where X= e~° and c is a constant. 


(iii) If wn) > co then |Aut(Pn,n)| = 1 as. 


7. Perfect Matchings 


The last three papers Erdés and Rényi wrote on the subject of random graphs 
were devoted to the existence of 1-factors. In [ER64] and [ER68] they coped 
with the relatively easier case of random bipartite graphs. In both papers 
they consequently emphasized the matrix terminology. “In the present paper 
we deal with certain random 0-1 matrices. Let M(n,N) denote the set of 
all n by n square matrices among the elements of which there are exactly N 
elements (n < N <n?) equal to 1, all the other elements are equal to 0. The 
set M(n,N) contains clearly (=) such matrices; we consider a matrix M 
chosen at random from the set M(n,N), so that each element of M(n, N) 


has the same probability (=) . to be chosen. We ask how large N has to 
be, for a given large value of n, in order that the permanent of the random 
matriz M should be different from zero with probability > a, whereOQ<a< 1. 
(...) A second way to formulate the problem is as follows: we shall say that 
two elements of a matriz are in independent position if they are not in the 
same row and not in the same column. Now our question is to determine 
the probability that the random matriz M should contain n elements which 
are all equal to 1 and pairwise in independent position.” 

The result they prove resembles that for the connectedness (compare 
Theorem 2.1). 


Theorem 7.1 ([ER66]). Let P(n,N) denote the probability of the event 
that the permanent of the random matrix M is positive. Then if 


N(n) = nlogn + cn + o(n) 
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where c is any real constant, we have 


lim P(n,N(n)) = e77¢ © 


n—->co 


Finally, they also mention graphs: “This result can be interpreted also in 
the following way, in terms of graph theory. Let n,n be a bichromatic random 
graph containing n red and n blue vertices, and N edges which are chosen at 
random among the n? possible edges connecting two vertices having different 
color (so that each of the (=) possible choices has the same probability). 
Then P(n, N) is equal to the probability that the random graph Tn,n should 
contain a factor of degree 1, t.e., Vn,n should have a subgraph which contains 
all vertices of [,,n and n disjoint edges, i.e., n edges which have no common 
endpoint.” (They seem not to use the name ‘perfect matching’ at all.) 

As far as the proof is concerned, “Besides elementary combinatorial and 
probabilistic arguments similar to that used by us in our previous work on 
random graphs (...) our main tool in proving our results is the well-known 
theorem of D. Konig, which is nowadays well known in the theory of linear 
programming, according to which if M is an n by n matrix, every element 
of which is either 0 or 1, then the minimal number of lines (i.e., rows or 
columns) which contain all the 1-s, is equal to the maximal number of 1-s in 
independent position. As a matter of fact, for our purposes we need only the 
special case of this theorem, proved already by Frobenius (1917), concerning 
the case when the maximal number of ones in independent positions is equal 
ton (...). According to the theorem of Frobenius-Kénig 1— P(n, N) is equal 
to the probability that there exists a number k such that there can be found k 
rows andn—k—1 columns of M which contain all the ones (0< k <n—1).” 
The rest of the proof is devoted to showing that this is very unlikely for 
N(n) given. It is interesting to notice that Erdés and Rényi never mention 
Hall’s theorem, which is equivalent to Frobenius but far more popular in 
combinatorics nowadays. 

The 1968 paper is a straightforward extension of the 1964 result, where 
it is shown that setting 


N(n) = nlogn + (r — 1)nloglogn + nw(n) 


where w(n) tends arbitrarily slowly to infinity then almost surely the 
bichromatic random graph contains r disjoint 1-factors. The only new element 
of the proof is the observation that if there are no r disjoint 1-factors then 
there is a way to delete some edges so that no vertex looses more than r — 1 
from its degree and the resulting subgraph contains no 1-factor at all. Then 
again the theorem of Frobenius is used. 

The most involved of the three papers about 1-factors is that from 
1966, where an ordinary (not bichromatic) random graph [7 is considered. 
The reason is that the theorem of Tutte describing the structure of graphs 
which admit 1-factors is more complex than its counterpart in the bipartite 
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case. “It should be added that the problem investigated in the present paper 
is much more difficult than the corresponding problem for even graphs solved 
in [5]. Thus for instance in [5] we made use of the well known theorem of 
D. Konig; the corresponding tool in the present paper is the much deeper 
theorem of Tutte mentioned above.” ([5] = [ER 64]) 

The result of that paper says that the threshold for containing a 
1-factor coincides with that for disappearance of isolated vertices, and thus 
also with that for connectivity (see Theorem 2.1). The proof is long and 
tedious and involves a weaker version of Tutte’s theorem ignoring the parity 
of components. 

Erdés and Rényi make also the following claim: “If N = snlog n+O(n), 
as mentioned above, with probability near to 1, Pn,n consists of a connected 
component and a certain number of isolated points. With the same method 
(...) one can prove that if the connected component of [n,n consists of an 
even number of points, it has with probability near 1 a factor of degree one. 
As the proof of this result is almost the same (...) we do not go into the 
details.” 

The above mentioned result was proved (in a strengthened form) by 
Bollobés and Thomason [18]. In order to quote that result let us extend 
the notion of a perfect matching by saying that a graph satisfies property 
PM if there is a matching covering all but at most one of the nonisolated 
vertices. It is known that, switching to the binomial model, as soon as 
2np — logn — loglogn — ov, there are only isolated vertices outside the 
giant component. However, the main obstacle for the property PM is the 
presence of a pair (at least two such pairs when the number of nonisolates is 
odd) of vertices of degree 1 adjacent to the same vertex (called, as we already 
mentioned, ‘a cherry’). The expected number of cherries is 


3 (3 )ra — p)2n-3) < n3p%e-2P+6P = (1) 


if 2np —logn — 2 log logn —> oo. Again, a trivial necessary condition becomes 
almost surely sufficient. 


Theorem 7.2 ([18]). Let y, = 2np — logn — 2loglogn > oo. Then 


0 if Yn > —00 
P(D np € PM) — e-se if Yn 7 C 
1 if Yn > OO. 


The proof, again, was based on Tutte’s theorem. Years later Luczak and 
Rucitiski proposed an alternative approach, via Hall’s Theorem, invented 
in [65] to attack a more general question. For a given graph G, a perfect 
G-matching of a graph is a spanning subgraph which is a disjoint union of 
copies of G. For G = Kg this is the ordinary notion of a 1-factor. 
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In [65] it was shown that for every nontrivial tree T, the threshold is the 
same as that for disappearance of isolated vertices. 


Theorem 7.3 ([65]). For every tree T on t vertices and with at least one 
edge, assuming n is divisible by t, 


0 if np — logn + —oo 
P(En,p has a perfect T-matching) + 4 e-* °  ifnp—logn>c 


1 if np — logn + oo. 


The threshold for arbitrary G is not known in general. Some partial results 
are contained in [4] and [72]. 

Coming back to the original papers of Erdds and Rényi, the last of them 
is concluded by the following problem: “does a random graph T,,n where n 
is even and 


1 -1 
N= grlcgn + ——nloglogn + w(n)n 


where w(n) — oo, contain at least r disjoint factors of degree one with 
probability tending to 1 for n + co?” 

Shamir and Upfal [77] answered this question in the positive. Given a 
map f of V(G) into the set of non-negative integers, define an f-factor of G 
as a spanning subgraph of G in which the degree of vertex x is f(z). 


Theorem 7.4 ({77]). Jf 
1 
p= (login +(r—1)loglogn+w(n), 


PS ly lim, 9wln) = oo and 1 < f(a) <r, oy Je) even. then Ty» has 
an f-factor, almost surely. 


Although f-factors are characterized by Tutte’s theorem, Shamir and 
Upfal chose an alternative approach using an algorithmic technique (in- 
troduced to random graphs by Pésa) of augmentation of sub-factors by 
alternating paths. In fact, the answer to the last question of Erdés and 
Rényi does not follow directly from the above result (not every r-factor 
has a 1-factorization) but from the proof. In 1985 Bollobds and Frieze [17] 
strengthened this answer by proving that almost surely in the random graph 
process of adding edges one by one, as soon as the minimum degree becomes 
r, there are |r/2| disjoint hamiltonian cycles plus a disjoint perfect matching 
if r is odd. 

The next problem we would like to mention cannot be directly attributed 
to Erdés and Rényi. Here is how Erdés describes their omission [3, Ap- 
pendix B]. “When Rényi and I developed our theory of random graphs, we 
thought of extending our study for hypergraphs. We mistakenly thought that 
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all (or most) of the extensions would be routine and we completely overlooked 
the following beautiful question of Shamir. (...) Shamir asked how many 
triples must one choose on 3n elements so that with probability bounded away 
from zero one should get n vertex disjoint triples. Shamir proved that n3/? 
triples suffice, but the truth may very well be n'*© or even cnlogn. The reason 
for the difficulty is that Tutte’s theorem seem to have no analogy for triple 
systems or more generally for hypergraphs.” The result mentioned by Erdés 
belongs, in fact, to J. Schmidt-Pruzan and E. Shamir [75]. In 1995, Frieze 
and Janson in [35] pushed the bound down to n4/3, 

Fortunately, Erdés and Rényi did not overlook some other important 
problems which stimulated the research in the theory of random graphs 
over the years. One such problem was the threshold for existence of a 
Hamiltonian cycle in a random graph. They, in fact, asked only: for what order 
of magnitude of N(n) has T,.n(n) with probability tending to 1 a Hamilton- 
line (i.e., a path which passes through all vertices). This problem was first 
tried by Posa [69] and Korshunov [55] and finally solved by Kémlos and 
Szemerédi [54] and, in a stronger form, by Bollobas [12]. They proved that 
the threshold for Hamiltonian cycle coincides with that of disappearance of 
all vertices of degree 0 and 1. 


8. Update for the Second Edition 


We wrote this paper back in 1995. In this second edition of the volume we 
decided to leave the original text intact except for a few obvious corrections 
and the proofs of Theorems 3.2 and 3.3 which have been deleted entirely. 
However, several new developments have occurred afterward. Here we would 
like to mention some of them along with a couple of earlier results omitted 
in the first edition. Needles to say, our choice is quite subjective. For more 
thorough treatment of random graphs we refer the reader to the monograph 
[43] published in 2000. 

In relation to connectivity, one should note that an old result of 
Luczak [60] states that the k-core of a random graph, for p large enough, 
is a.s. empty or k-connected. It implies that T,,,, is a.s. c([',,»)-connected for 
the ranges of N larger than those in Theorem 2.2. 

In the domain of small subgraphs of random graphs there has been an 
intense study of the so called upper tail of the random variable Xg counting 
copies of a given graph G' in, ». As far as the lower tail is concerned, whose 
special case is the probability P(X = 0) discussed briefly after Theorem 3.2, 
the asymptotic order of magnitude of the logarithm of P(X < (1 — «)EX) 
has been determined in [39] to be —®g. The exponent in the upper tail, 
P(X > (14+ 6)EX), is of a smaller order of magnitude which is still to 
be determined. In [44] general lower and upper bounds were obtained which 
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differ only by a logarithmic factor. Very recently DeMarco and Kahn [21] have 
found the right threshold for cliques and formulated the “right” conjecture 
for the general case. 

In Sect.5, the threshold for topological cliques found in [2] has been 
sharpened (see [62], a remark after Corollary 18). A significant result about 
the chromatic number of a random graph appeared in [1]. Achlioptas and 
Naor found therein an explicit two-point limiting distribution of y(T‘n,p), 
where p = d/n, for every d > 0, strengthening a theorem from [61] mentioned 
at the end of Sect. 5. 

The most acclaimed result in random graph theory which appeared after 
1995 is, without doubt, a solution to the celebrated Shamir problem posed 
in Sect. 7. After some initial attempts (Krivelevich [52, 53] and Kim [50]), in 
2008 Johansson, Kahn, and Vu [46] published a complete solution to both, the 
hypergraph Shamir problem and to its random graph counterpart (triangle- 
factors), receiving for their achievement the prestigious Fulkerson Prize. Quite 
recently in a series of papers, Dudek, Frieze, Loh, and Speiss [22-24, 34] 
obtained thresholds for the hamiltonicity of random uniform hypergraphs. In 
the hardest case of so called loose Hamilton cycles they incorporated in their 
proofs the result on perfect matchings from [46]. 
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Summary. We prove an unexpected upper bound on a communication game 
proposed by Jeff Edmonds and Russell Impagliazzo [2, 3] as an approach for proving 
lower bounds for time-space tradeoffs for branching programs. Our result is based 
on a generalization of a construction of Erdés, Frankl and Rédl [5] of a large 3- 
hypergraph with no 3 distinct edges whose union has at most 6 vertices. 


1. Introduction 


Suppose that we have two vectors u and v of length k. We want to decide 
whether u = v, but our access to the bits is very limited—at any moment 
we can see at most one bit of each pair of the bits u; and v;. You can imagine 
the corresponding bits to be written on two sides of a card, so that we can 
see all the cards but only one side of each card. After every flip we can write 
down some information, but the memory is not reusable—after the next flip 
we have to use new memory. We are charged for every bit of memory that 
we use and for every time we flip one or more cards. 

It seems natural to suppose that if we flip the cards only a few times, we 
need a lot of memory. We prove an unexpected upper bound on the amount 
of memory needed; our bound is asymptotically tight if the number of card 
flips is constant. Our result is based on a construction of Erdés, Frankl and 
Rédl [5], whose special case is a large (in the number of edges) 3-hypergraph 
(a system of 3-element sets) with no 3 distinct edges whose union has at 
most 6 vertices, and on a previous result of Rusza and Szemerédi [8]. This 
special case of their construction corresponds to the case when only two flips 
of the cards are allowed. Our main idea is a geometric interpretation of the 
construction of a large set with no three element arithmetic progression due 
to Behrend [1] and its generalization to higher dimensions. We discuss this 
connection in Sect. 4. 

In fact, Jeff Edmonds and Russell Impagliazzo proposed this game as a 
tool for proving lower bounds in complexity of boolean functions and proved 
that a reasonable lower bound on the sum of the number of flips and the 
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number of bits of memory used during the game would imply a new lower 
bound for branching programs [2, 3]. We give a simple protocol with O(log n) 
probes which uses only O((log n)?) bits of memory. We discuss this connection 
and protocol in Sect. 5. 

First we describe the game more precisely and state some easy facts in 
Sect. 2, and prove our upper bound in the communication game setting in 
Sect. 3. 


2. The Game 


Formally we describe the game as follows. Each input «x is divided into 
some pieces (substrings) 21,...,2, in a fixed way (ie., it is given which 
coordinate belongs to which piece). Each piece corresponds to a single card. 
Let I,,..., U7 be a sequence of subsets of [1,7] and let the inputs u and v be 
given. Then II;(u,v) denotes the input consisting of the pieces x; defined as 
x, =u; ifi € Ti, and x; = u; if i ¢ I. These input vectors are called probes, 
and each of them corresponds to a choice of a visible side for each card. The 
protocol is described by the T probes Ij,...,II7 and a function Fu) on 
the input vectors. If F(u) = F(Ih(u,v)) =--- = F(Ir(u,v)), the protocol 
answers “u = v”, if not, the answer is “w 4 v”. Let B be the number of bits of 
memory that we need, i.e., the maximal length of F'(u) over all inputs uv. We 
are interested in the dependency of B on T and k. In the context of timespace 
tradeoff, the most important quantity is the minimal possible B + T, which 
corresponds to the total communication. 

This corresponds to a protocol in which we first write down some 
information about u, and then after each of T flips we just check whether 
the current probe is consistent with that information. It is obvious that if we 
discover inconsistency, the vectors are different. Thus a protocol is correct if 
no two distinct vectors pass the test. In fact, here the use of memory is even 
more restricted than in the version described in the introduction—effectively 
the first time the protocol writes down arbitrary information F'(u) but then 
after each flip we write down only a single bit indicating consistency of the 
current probe. It is easy to show that this restriction increases the amount 
of memory by at most the maximum of B and T, see [2]. 

We can describe the set of probes in another equivalent way, more 
convenient for our proof. For each i < r let V; C [1,7] denote the set of 
indices of the probes such that t € V; if and only if the ith piece of I,(u, v) 
is v; (as opposed to u; in other probes). We can assume that all sets V; are 
distinct, because the pieces which appear in identical sets of probes can be 
joined (we can use one card instead of two cards that are always flipped 
together). Also, for every i, V; is nonempty, as at least one probe has to look 
at v; in a correct protocol. This means that r < QF _ |. 

In case of T = 1, the game is just the usual communication game with 
one player having access to u and the other player to v; it is easy to see 
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that k bits of memory are needed to test equality in that case. Similarly, it 
is easy to see that B must be at least the length of any piece u,. If we fix 
all other pieces to be all zeros, then every probe is either u; or v;, and a 
correct protocol needs enough memory to distinguish any two distinct inputs 
u;. This shows that B > k/r > k/(27 —1). 

We describe the best previously known protocol; it uses B = [k/T] bits 
of memory [2]. Set r = T and divide the input into r pieces of the same 
length (we assume without loss of generality that & is divisible by r). Set 
Il, = {t}, or equivalently V; = {7} (i.e., each probe looks at just one piece 
of v and the rest comes from u). Set F(u) = ui © uz ®++: @ uy to be the 
bitwise parity of the pieces. If the protocol answers “w= v”, we know that 
F(u) = FU ((u,v)) = vi @ uz ®--: @ up and hence uy = v1. The same 
argument is valid for other pieces, hence u = v and the protocol is correct. 

It is easy to prove that no protocol in which the function F' is linear (over 
GF(2)) can be better. The equation F(u) = F(Ih(u, v)) =--- = F(Ir(u, v)) 
translates into a system of BT linear equations with 2k unknowns. If the 
protocol is correct, then the points in the k-dimensional subspace defined by 
u = v are the only solutions of the system of equations, and hence there have 
to be at least k equations. This gives B > k/T. 

Jeff Edmonds conjectured that this is in fact optimal even for non-linear 
protocols, i.e., B = 0(k/T) for every protocol [2]. We disprove this conjecture. 
In fact, we prove that for constant T the easy lower bound is much closer to 
the truth as our protocol needs only k/(27 —1)+O(Vk) bits of memory. If the 
number of probes is not bounded it is possible to achieve B+T = O((log k)?) 
using a very simple protocol. We discuss this protocol and its consequences 
in Sect. 5. 


3. The Upper Bound 


Theorem 1. For each parameter d and for each T there exists a protocol 
with T probes such that the number of bits of memory B is at most 


ip k 
1. =) |— T+ 2d+1-+logk)2?7-! 
(1+ 5) [pea] + + 2d +1-+ logk) 


Corollary 1. For T constant and d = Vk the bound is k/(27 —1) + O(Vk). 


Proof of Theorem 1. First we present the protocol. 

Each input vector is divided into r = 27 —1 pieces u1,...,u, of the same 
length | = [k/r]. We define the probes by taking the sets V;, i = 1,...,r 
to be all nonempty subsets of [1,7]. (In other words, the probes intersect as 
much as possible and the pieces are all of the same size—we have seen that 
this is necessary in a good protocol.) 

We represent each wu; as a real vector of dimension [!/d], where d is the 
parameter from the statement of the theorem, as follows. We partition uj, 
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a string of I bits, into [//d] substrings of d bits. Each coordinate is one of 
these substrings (in a given order) interpreted as an integer from [0,2¢ — 1]. 
From now on we abuse the notation and by u; and v; we mean the vectors 
described above, interpreted as points in the Euclidean space R!!/41, 

Now we are ready to describe the function F(u). Let ||x|| denote the 
Euclidean norm of a vector. Let ug denote the center of gravity of u,,..., Ur, 
ie., Uo = (i_, ui)/r. The function F(u) consists of the concatenation of uo 
and all the distances ||u; — u;||,O<i<j<r. 

As always, we first examine u and write down F(u) and then for each 
probe we check if the value of F' is equal to Fu). This finishes the description 
of the protocol. 

Let us compute the number of bits of F'(u). Instead of communicating uo, 
we can communicate the vector rug, as r is a scalar constant. Its coordinates 
are integers from [0,r(2¢ — 1)], hence d+ logr bits are sufficient for each 
coordinate, a total of (d+ logr)[I/d] < (1+ T/d)[k/r] + T +d bits for all 
coordinates. Instead of each distance we communicate its square multiplied 
by r?. This is a non-negative integer bounded by r?(2?4—1)[1/d] < r?2?41 < 
2r274k, hence it can be represented by at most T+ 2d+1+logk bits, a total 
of (T + 2d+1 + logk) Ce) for all distances. This gives the bound in the 
theorem. 

To prove the correctness of the protocol, we need to prove that if two 
inputs u and v have the same value of F for u and all probes I,,...,I7, 
then u = v. The intuitive idea is that for a given piece v; we have sufficient 
information about its distances from u,;, 7 # 7, to conclude that vj = ui. 

We need a simple geometric lemma, which we prove later. Recall that a 
point « € R” is affinely dependent on points 71,...,2, € R” if it can be 
written as their linear combination 4 a,x; such that a a; =1. 


Lemma 1. Let 7,2 1,...,2, be points in Euclidean space R” such that x is 
affinely dependent on 1,...,2,. Let y © R” be a point satisfying ||a —x;|| = 
lly — y;|| for alli =1,...,r. Thenz=y. 


Now we finish the proof of the theorem using this lmma. We can assume 
that the pieces of input are indexed in such a way that V; > V; implies 7 < j, 
i.e., in the reverse topological order with respect to inclusion of the sets V;. 
(This means for example that the piece v; appears in all probes.) 

We prove by induction on 7 = 1,...,r that u; = v;. Suppose that we are 
proving the induction step for 7, i.e., we have to prove that u; = v;. We want 
to use the lemma with x = ui, {71,...,¢-} = {uo,...,ur}—{ui}, and y = vj. 
From the construction we know that x is affinely dependent on the remaining 
points. We need to prove that ||u;—u,|| = ||v;—u,|| for all 7 =0,...,r, 7 Fi. 
We distinguish three cases. 

First, let 7 > 7. Take any t € V; — V;. By the assumption about the 
indexing of the pieces we know that such t exists. This means that (II;); = v; 
and (II); = u,; (we use (II;); as a shorthand for (II(u,v));, ie., the point 
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that represents the ith piece of the vector II;(u, v)). Now using the fact that 
F(u) = F(1y(u,v)) we get |lus — ujl| = ||): — (Hl)jll = [les — wll 

The second case is 0 < 7 < 7. Now take any t € V;. By induction 
assumption we already know that u; = v;, hence (II;); = u,;. As in the 
previous case, we get |jus — tyl| = ||(Ul:)i — (He),l| = luv — vy = [lbs — wy. 

In the last case, 7 = 0, take again any t € V;. We know that uo = (Iz)o 
because ug is a part of F'(u). The rest is again the same. 

We have established the assumptions of the lemma, and its application 
finishes the induction step. We can conclude that u = v, hence the theorem 
holds. 


Proof of Lemma 1. First we prove that the vector y — x is orthogonal to 
x; — x; for every i and 7. Using the assumption of the lemma we have 


1 
0 = 5 (lly — @all? — |x — ell? + || — yl? — lly — 2yll*) 


*( 


=-Yy U+X 4; zi a;+y' 2; =(r&—y) Li — 25). 


This means that x is the projection of y on the affine subspace generated 
by {x1,...,2,}. Using Pythagoras theorem, the projection of a point outside 
the subspace is always closer to any point in that subspace, hence y has to 
be identical with z. 


Let us point out that our protocol works for any r, as long as every two 
pieces are distinguished by some probe. The choice of r = 27 — 1 is done to 
optimize the bound. 

We could have saved some communication in the protocol. Instead of 
taking the center of u,,...,u, it is possible to communicate the shift from u,. 
to the center of u,,...,U,— , and then to use this center instead of uo for the 
distances. If we do this, it is not necessary to communicate the distances from 
ur at all. It is also not necessary to communicate the distances to the center, 
as they can be computed from the other distances. But all these savings still 
leave us with O(r?) distances to communicate. 


4. The Connection with Extremal Problems 


In this section we describe how the communication game can be translated 
into an extremal problem about hypergraphs and how that problem is related 
to extremal problems about arithmetic progressions. 

Suppose that the input is divided into pieces u4,...,u, as before, and 
let U,,...,U, be the sets of all possible values for each piece. We assume for 
simplicity that all pieces have the same size, and denote m = |U4,| =--- = 
U,.| = 2/7 Let G be the complete r-partite r-hypergraph on these sets of 
vertices. This means that each edge is a set of r points, exactly one from each 
U;. Each edge of G naturally corresponds to some input vector (u1,..., Ur). 
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The function F(u) from the protocol corresponds to a coloring of the 
hypergraph by 2? colors. The condition that the protocol is correct means 
that for no u = (w1,...,Ur), U = (U1,---, Ur), all the edges corresponding to 
u, Ih(u,v),..., I r(u,v) have the same color. In other words, some specific 
patterns (or subhypergraphs) are not allowed to be monochromatic. (We get 
more patterns, because some pieces of wu and v may be equal and we get 
degenerate versions of the original pattern.) 

Note that we always need at least m colors, as the edges (u1,..., ur—1, ur) 
and (u,...,ur—1, wp) must always have different color, since this is a 
degenerate version of the prohibited pattern. (This is really just a translation 
of the trivial lower bound into the new language, because this degenerate 
pattern just corresponds to the case in which we change just one component 
of u.) This also means that every hypergraph without a prohibited pattern 
has at most m”~! edges (out of m” possible). 

To prove a lower bound for the communication game it would be sufficient 
to prove that any hypergraph with too many edges necessarily contains a 
prohibited pattern. For the upper bound we not only need to find a large set 
with no prohibited pattern, but to decompose the complete hypergraph into 
a small number of such sets. 

This kind of problems—to find a maximal size of a structure without 
a given pattern—is well-studied in extremal combinatorics, so it is not 
surprising that at least the simplest cases of our problem have been studied. 
Most of the information about it that we present now is from the survey 
Graham and Rédl [6] and the paper by Erdés, Frankl and Rédl [5]. These 
papers also contain simple proofs for some of the results that we mention 
below. 

Let us look at the case T = 2 and r = 3. Now the prohibited pattern are 
the 3 edges (uy, U2, U3), (W1,V2,V3) and (v1,v2,u3). The degenerate version 
of this pattern are any two edges that differ in a single point. We are now 
interested in the maximal number of edges of a hypergraph that does not 
contain any of these patterns. 

For this case Rusza and Szemerédi [8] proved a slightly better bound than 
the trivial one O(m?) mentioned above, namely they proved that the number 
of edges is o(m?). This is proved using Szemerédi’s regularity lemma [9], and 
unfortunately does not give a good bound for “o”. 

This problem is actually related to a problem of finding a large set of 
numbers which contains no arithmetic progression of length 3, as was noticed 
first in [8]. Suppose that we have a set A C [0, (m—1)/2] with no arithmetic 
progression of length 3. Then we construct a hypergraph without a prohibited 
pattern by taking YU, = U2 = Us = [0,(m— 1)] and putting in all edges of 
the form (u,u+a,u+ 2a) for a € A and wu arbitrary (the addition is taken 
modulo m), i.e., all arithmetic progressions with one element from each set 
and modulus from A. 

Obviously no degenerate prohibited pattern can appear, because if two 
arithmetic progressions have two points identical, the third is identical 
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as well. A little checking shows that the non-degenerate prohibited pattern 
corresponds exactly to the situation where the moduli a, a’, and a” are an 
arithmetic progression of length 3. So, if we have a large set A, we have a 
large hypergraph. How large can A be? The best known bounds on the size 
of such A are 

m 
(log m)2Q)” 


The lower bound is a classical result from Behrend [1], the upper bound is 
due to Heath-Brown [7]. Improving these bounds is considered to be a very 
hard problem. 

To have a small coloring, we need to decompose the interval [0, m] into 
a small number of such sets, but that turns out to be easy. We also need to 
color the edges that are not arithmetic progressions, but that is trivial by 
using a new set of m/|A| colors for edges of the form (u,u+a,u+2a+4c), 
for every constant c, a total of m?/|A| colors. Thus the construction based 


oo 
EO ae 


on the largest known set |A| gives us a coloring by m20(V°8™) colors, which 
corresponds to communicating k/3 + O(Wk) bits in our game. 

Our upper bound for the communication game is based on this construc- 
tion, translated into the geometric language, so that it can be generalized into 
a higher dimension. In our construction, the arithmetic progression of three 
points is replaced by our two points and their geometric center. In particular 
our protocol gives a construction of large r-hypergraphs without certain 
prohibited patterns which generalizes the well-known case of 3-hypergraphs 
with no 3 distinct edges whose union has at most 6 vertices. 


5. Connection to Time-Space Tradeoffs 


This communication game was proposed by J. Edmonds and R. Impagliazzo 
[2] as a tool for proving lower bounds in complexity of boolean functions. 
We shall briefly describe the kind of results that one could possibly obtain 
without going into details in order to motivate our combinatorial result, for 
more information about this connection see [3]. 

A branching program is an oriented acyclic graph with one source, two 
sinks and each vertex, which is not a sink, having outdegree 2; the edges are 
labelled by variables and negated variables so that for each vertex we have a 
variable and its negation at the two outgoing edges; the sinks are labelled by 
accept and reject. An input vector determines a unique path from the source 
to a sink, the label at the sink determines, if the vector is accepted or not. 
The reason for introducing this special kind of a circuit is that the logarithm of 
the minimal number of vertices of a branching program is a natural measure of 
space (also called capacity) needed for computing a boolean function. This is 
because we can think of a vertex in the branching program as a configuration 
of the memory of a computational device. Similarly, the maximal length of a 
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path from the source to a sink corresponds to time, however this measure of 
complexity is interesting only if combined with some restriction on the size 
of the branching program. 

Since proving nontrivial lower bounds to a single measure, such as space, 
which we have described above, seems to be a very hard task, it is natural to 
try to prove lower bounds for combined measures. The branching program 
model of computation is an ideal combinatorial setting for proving a lower 
bound for the combined measure time x space. Nevertheless, so far we 
have only the trivial lower bound nlogn (for an explicitly defined n-variable 
boolean function). 

Edmonds and Impagliazzo showed [3, 4] that if we could prove a lower 
bound f(&) on the total number of bits of memory plus the total number of 
flips, we could prove a lower bound of n,/f(n)/logn for time-space product 
for oblivious branching programs for the function of element distinctness. 

However, the following simple protocol discovered by Russell Impagliazzo 
and the authors shows that it is possible to test the equality using only 
O(log k) probes and communicating O(logk) bits about each probe. This 
protocol can be converted into a protocol of the form used in Sects. 2 and 3 
that uses O((log k)?) bits of memory. 

We think of u and v as 0-1 vectors in real vector space R*. We compute 
the Euclidean distance of u and v and check if it is 0. To compute the distance, 
w-u+v-v—2(u-v), we compute u-u and v-v each using one probe and log k bits 
of communication. Then we compute the product (ys ui)(dot4 v3) using 
the same probes and additional 2logk bits of communication. To compute 
the desired inner product u-v we need to subtract the sum of terms u,v; for 
i # j. This is easily done using 2logk probes—choose them so that each of 
the crossterms can be computed by one of them, and for each probe sum all 
of these terms assigned to it. 

This protocol shows that a lower bound for element distinctness cannot 
be proved using this communication game. A more general game for which 
the above protocol cannot be used and thus seems as a feasible approach to 
time-space lower bounds was proposed by Edmonds and Impagliazzo in [4]. 
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Summary. The first paper with the above title was written by Erdés and Straus. 
Here we solve one of the problems considered there by proving that every group of 
order n contains an abelian subgroup of order at least 2°V'°” for some ¢ > 0. This 
result is essentially best possible. 

We also give a quick survey of recent developments in related areas of group 
theory which were greatly stimulated by questions of Erdés. 


1. Introduction and a Survey 


Large Abelian Subgroups 


One of the most ancient questions of group theory is: does every infinite group 
contain an infinite abelian subgroup? 

As a byproduct of their work on the Burnside problem P. S. Novikov and 
S. I. Adian (see [1]) obtained a negative answer to this question. More recently 
E. Rips [62] and A. Yu Ol’shanskii (see [58]) have independently constructed 
Tarski Monsters, i.e., infinite groups all of whose proper non-trivial subgroups 
have order p where p is some large prime. 

On the other hand as shown independently by P. Hall and C. R. Kulatilaka 
[32] and M. I. Kargapolov [40] the answer is positive for locally finite groups. 
Various results of similar flavour have been established since (see e.g., [74]). 
Recently A. Mann [48] suggested that the “dual” class of groups should also 
be considered: does every infinite residually finite group contain an infinite 
abelian subgroup? 

(A group is called residually finite if the intersection of its finite index 
normal subgroups is trivial.) 

The finite analogue of the above question goes back to Jordan (see [46]). 
G. A. Miller [46] was the first to publish a complete proof of the following well 
known result: a group of order p® (p prime) contains an abelian subgroup of 
order at least pv2e, 
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It was proved by P. Erdés and E. G. Straus [26] that in general a group 
of order n contains an abelian subgroup of order roughly log n. 

Erdés suggested that this estimate should be improved (in fact he put a 
copy of their paper in my letter-box). 

Our main result here is the following. 


Theorem 1. Every group of order n contains an abelian subgroup of order 
at least 2°V"8" for some ¢ > 0. 


(Throughout this paper log denotes logarithm to the base 2.) 

The proof uses an easy consequence of the Classification Theorem of finite 
simple groups. 

In the opposite direction Ol’shanskii [57] (see also [14]) has shown the 
existence of groups of order p* without abelian subgroups of order > pv8e, 

Note that despite much effort, no explicit construction of such p-groups 
is known. Indeed Ol’shanskii’s proof is one of the first applications of the 
random method in finite group theory. Ol’shanskii considers certain random 
Higman groups, i.e., p-groups with a central elementary abelian Frattini 
subgroup (describing such a group is essentially equivalent to giving two 
GF ‘(p) vector-spaces V and W together with an alternating bilinear function 
VxVoW.) 

For some recent applications of this method see [4, 29, 49, 50]. 


The Commuting Graph 


We can associate with an arbitrary group G its commuting graph T = I'(G): 
the vertices of I are the elements of G and two vertices g,h are joined by an 
edge if and only if g and h commute as elements of G. The maximal cliques 
of I correspond to maximal abelian subgroups of G, the dominating vertices 
are exactly the elements of the center Z(G) of G etc. 

Answering a question of Erdés, B. H. Neumann [54] (see also [28]) proved 
that if [(G) contains no infinite independent set, then there is a finite bound 
on the cardinality of independent subsets of [. Moreover he proved that in 
this case |G/Z(G)|, the index of the center is bounded in terms of a = a(T), 
the maximal size of an independent set of I. 

It was proved in [60] that in fact we have |G/Z(G)| < c®% for some (rather 
large) constant c. 

Let us denote by a(G) the minimal number of abelian subgroups covering 
G (note that a(G) is the chromatic number of the complementary graph 
I(G)). It is obvious that a(I'(G)) < a(G) < |G/Z(G)|. 

Confirming a conjecture of Erdés and Straus [26] D. R. Mason [51] proved 
that for a group of order n we have a(G) < [4] +1. 

As noted in [26] I. M. Isaacs has shown that a(G) < (a!)?. Isaacs also 
observed that for extra special 2-groups we have a(G) > 2°/? (see [6] for 
details). 


How Abelian is a Finite Group? 411 


Erdés [23] suggested that the upper bound should be improved. Such an 
improvement is a consequence of the above mentioned result from [60] namely 
we have a(G) < c®. 

In the opposite direction as R. Baer observed (see [53]) if a(G) is finite 
then |G/Z(G)| is finite. M. J. Tomkinson [69] proved that in fact we have 
|G/Z(G)| < 2a(G)'"e*@ and suggested that perhaps |G/Z(G)| < (a(G)—1)? 
is true. 

In response to a question of Erddés, V. Faber, R. Laver and R. McKenzie 
[28] showed that if a(G) = « is an infinite cardinal then |G/Z(G)| is still 
bounded in terms of a(G). Using a Partition Theorem of Erdés, A. Hajnal and 
R. Rado [25] and Tomkinson [69] improved this estimate to |G/Z(G)| < 22”. 
Tomkinson suggests that perhaps even |G/Z(G)| < 2" is true (this would be 
best possible [28]). 

Given a class y of groups we can associate with an arbitrary group G 
a graph I) (G) as follows: the vertices of Ty are the elements of G and two 
vertices g, h are joined by an edge if and only if (g,h) € vy. Thus if x is the 
class of abelian groups then P',(G) is the commuting graph. 

J. C. Lennox and J. Wiegold [45] suggested that the questions of Erdés 
should also be considered for the graphs I',(G) where x is taken to be 
the class of finite groups, soluble groups, polycyclic groups etc. For finitely 
generated groups G they obtained certain analogues of B. H. Neumann’s 
result mentioned above. Further results in this direction were obtained in 
[15, 30, 39, 71, 76]. 

As Lennox and Wiegold noted [45] when considering infinite groups it 
is necessary to restrict attention to finitely generated soluble groups, for 
otherwise these analogues are no longer true. For example M. R. Vaughan-Lee 
and Wiegold [73] constructed infinite perfect groups in which every two- 
generator subgroup is nilpotent of bounded class (see [45] for more details 
concerning similar examples). 

However, such monsters are necessarily infinite. Indeed by a well known 
corollary of J. G. Thompson’s characterisation of minimal simple groups if 
every 2-generator subgroup of a finite group G is soluble then G itself is 
soluble. 

Furthermore, as Tomkinson [69] proved if a group G has an irredundant 
G: () Hj| < n!. This shows 
for example that if G is a finite group which on be covered by m 
soluble subgroups then |G/Sol(G)|, the index of the maximal soluble normal 
subgroup of G is bounded in terms of m. 

So it seems to be reasonable to investigate the properties of the graphs 
T,(G) for arbitrary finite groups. 


covering by n subgroups Hy, H2,...,H, then 
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Conjugacy Classes 


A group G is said to be a BFC group if its conjugacy classes are finite and 
of bounded size, it is called an m-BFC group if the least such upper bound 
is m. 

One of B. H. Neumann’s discoveries [52] was that in a BFC group the 
commutator subgroup G’ is finite. Wiegold [75] conjectured that for an 
m-BEFC group we have |G’| < m2¢+s™), This conjecture was confirmed 
for nilpotent groups by Vaughan-Lee [72]. In general P. M. Neumann and 
Vaughan-Lee [56] obtained the slightly weaker inequality |G’| < m2(3+5les™) 
(see [17] for some improvements). The proof of the exponential bound for the 
index of the centre [60] rests upon this result via the following observation: 
in a finite group G every conjugacy class has size at most 4a?(I'(G)). 

Further results concerning BFC groups are to be found in 
[36, Chap. VIII/9], [47, 56]. A group which has finite conjugacy classes is 
called an FC group. For the extensive theory of these groups we refer the 
reader to [63, 68]. 

Note that a group G is m-BFC exactly if 6 = tea —1 holds for the minimal 
degree 6 of the commuting graph I'(G). The maximal degree of I is clearly 
equal to |G| — 1. For soluble groups E. A. Bertram [7] started investigations 
concerning the largest order of the centralizer of a non-central element. This 
number is Aj — 1 where Az denotes the second largest degree of 

Confirming a conjecture of Bertram Isaacs [38] and J. Cossey [19] 
independently proved that for G soluble we have A, > IG|2. T. Kepka and 
M. Niemenmaa [41] proved using a classical result of R. Brauer and K. A. 
Fowler [13] that A, > |G|* for an arbitrary finite group G. While the bound 
for soluble groups is sharp, the result of Kepka and Niemenmaa can probably 
be improved. 

Let us denote by A;(G) the number of ordered ¢t-tuples of pairwise 
commuting elements of G. 

Another problem considered by Erdés and Straus [26] is that of obtaining 
lower bounds for A;(G). This was originally suggested by Linnik. 

As observed earlier by Erdés and Turan [27] we have Ao(G) = |G|k(G) 
where k(G) is the number of conjugacy classes of G (note that the number of 
edges in I'(G) is $|G|(k(G) — 1)). This latter invariant has been investigated 
repeatedly. 

Answering a question of Frobenius, E. Landau [44] proved in 1903 that 
for a given & there are only finitely many groups having k conjugacy classes. 
Brauer [12] noted that Landau’s argument implies k(G) > loglog|G| and 
proposed the problem of finding substantially better bounds. 

The first such bound was established in [61]: every group G of order n > 4 


satisfies k(G) > E Westen for some € > 0. 
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On the other hand very recently L. G. Kovacs [42] constructed groups 
of order n with roughly Cran aD conjugacy classes for certain numbers n 
(cyclic extensions of the p-groups constructed in [43]). 

Groups with a very large number of conjugacy classes were characterised 
by P. M. Neumann [55]. 

As the above discussion shows we know a lot about A2(G). Erdés and 
Straus [26] proved that A;(G)/At-1(G) — oo as |G| — oo. However it is 
not clear how fast A;(G) should grow when t > 3. For some partial results 
see [26] and the final section of this paper. 

For infinite groups G. Higman, B. H. Neumann and Hanna Neumann [34] 
have shown that every torsion-free group can be embedded in a torsion-free 
group with just two conjugacy classes. 

Furthermore Rips [62] has constructed countably infinite groups with 
exactly 3 non-conjugate subgroups. In contrast S. Shelah [66] proved that 
every group of cardinality X; has at least 8; non-conjugate subgroups. 

A few words about the rest of this paper. In Sect.2 we establish some 
general results on large subgroups of finite groups. Section 3 contains the 
proof of our main result concerning abelian subgroups. In the last section 
we offer some comments on some related problems. Our notation will mainly 
follow that of [67]. 


2. Large Subgroups 


We use the classification of finite simple groups via the following. 


Proposition 1. Let G be a nonabelian simple group and S a Sylow subgroup 
of largest order in G. If G is an alternating group then we have |S|?°8 8 lS 
|G| and in all other cases we have |S|° > |G]. 


Proof. For the alternating groups, the sporadic groups and for the Tits simple 
group our statement follows by inspection. For the groups of Lie type we have 
in fact |S|° > |G| as noted in [4]. 


It would be most interesting and useful to show the existence of “large” 
soluble subgroups of simple groups without using the Classification Theorem. 
It is curious to note that A. Chermak and A. Delgado [18] obtained an 
elementary proof of a result that goes the other way around: if G is 
a nonabelian simple group then for every abelian subgroup A we have 
IAP? < |G|. 

Following Wielandt we call a subgroup H intravariant in the group G if 
every image of H under an automorphism of G is conjugate in G to H. For 
example any Sylow subgroup of G is intravariant. 

We are going to use the following very special case of a theorem of S. A. 
Chunikhin [67, Chap.5, Theorem 3.17]. 
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Proposition 2. Let G=Go>G,...> G2 =1 be a normal series with each 
G,; normal in G. Let Gi; = Gi_1/G; denote the factors. Suppose for each i 
that H, is an intravariant subgroup of G;. Then there is a subgroup H of G 
such that all the nonabelian composition factors of H occur as composition 
factors of some of the groups H;, and |H| > |] |Hjl. 


Let us say that a group G is an alternating type group if all the nonabelian 
composition factors of G are alternating of degree at least 8 (we exclude 
alternating factors of smaller degree for technical reasons). 


Corollary 1. Let G be an arbitrary finite group. Then 
(a) G contains a soluble subgroup S such that 


IG| @ [SP ieetee ls. 
(b) G contains an alternating type subgroup H such that |G| < |H]|°. 
Proof. Let Gp > G,...& G, = 1 be a chief series of G. Then for each 


i the factor group G; = G; /G;-1 is a direct product of isomorphic simple 
groups. If G; is abelian we set G; = Sj, otherwise let $; denote a Sylow 
subgroup of largest order in Gj, 9; is a soluble intravariant subgroup of G; 
and by Proposition 1 we have |G;| < |S;|?!8!°8!5'|. Now (a) follows from 
Proposition 2. 

The proof of (b) is analogous. 


By a result of Dixon [20] a solvable subgroup of the symmetric group 
Sym(n) has order at most 24°5°. This shows that the “log log” factor in 
Corollary 1 (a) cannot be dispensed with. 

A folklore consequence [33, 48] of a well known result of P. P. Palfy [59] 
and T. R. Wolf [77] is that if G is a soluble group then |F'(G)|* > |G| holds 
for the Fitting subgroup F(G) of G (F(G) is the unique maximal nilpotent 
normal subgroup) where a = 2.24399.... Moreover H. Heineken [33] proved 
that a soluble group G has a nilpotent subgroup H such that |H|? > |G]. 

t 


By Miller’s theorem a nilpotent group of order n = [J p;* contains an 
i=1 
Hi 


t 
abelian subgroup of order at least [] pY*™ (it is not clear whether this holds 
i=1 


for say soluble groups). 
When combined the above results immediately yield that an arbitrary 


group of order n contains an abelian subgroup of order at least oe maken, 
To erase the “loglog” factor here we have to consider abelian subgroups of 
alternating type groups in more detail. 

Before that we would like to point out another application of the above 
simple ideas (and the Classification Theorem). 

As Jordan proved there is a function J(n) such that whenever G is a 
finite subgroup of GL(n,C) then G has an abelian normal subgroup A with 
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|G : A| < J(n). The best upper bound obtained for J(n) by elementary 
2 


means is roughly 2" (see [37, 64)]). 

As the symmetric group Sym(n + 1) has a faithful linear representation 
of degree n we know that J(n) > (n+ 1)!. 

Here we observe the following. 


Proposition 3. J(n) < (n!)?° for n sufficiently large. 


Proof. Let G be a linear group of degree n. By [37, Theorem 14.16] there is 
an abelian normal subgroup A of G such that |H : HM Aj < 12” holds for 
every abelian subgroup H of G. 

Let S be a solvable subgroup of maximal order in G (clearly S D A). 
By a result of L. Dornhoff (see [22]) a finite soluble subgroup of G'L(n,C) 
contains an abelian normal subgroup H of index less than c” for some c < 9. 
Now |S/A| < |S: H||H : HM Aj < 100”. Using Corollary 1 (a) we obtain 
that |G: A] < (100")?!8!08(100") < (n!)?° for n sufficiently large. 


Close to sharp estimates for J(n) appear in unpublished work of B. 
Weisfeiler (personal communication from G. R. Robinson). 


3. The Proof of the Theorem 


Let G be a group of alternating type of order n. Clearly all alternating factors 
of G have degree less than ¢ = [logn]. We define the numbers ¢; by @; = 


[log n] (where log denotes the i-th iterated logarithm of n) for i = 1,...t 


where ¢ is the largest natural number such that log~n > 6 and set (44; = 6 
and f:49 = 4. Note that 2°+1 > é; holds fori =1,...,t+1. 
We will need the following corollary of results due to Bercov [5]. 


Lemma 1. Suppose that a minimal normal subgroup of a finite group G is 
a direct product of (isomorphic) alternating groups of degree at least 7. Then 
N has a complement C in G (i.e., we have NC =G and NNC =1). 


Lemma 2. Let G be a group of alternating type. There exists a series of 
subgroups Go, G1,...,G, of G such that 


(a) Go is soluble and its order is the product of the orders of all abelian 
composition factors of G. 

(b) Fori =1,...,t the nonabelian composition factors of G; are alternating 
groups. The number of all such factors of degree r is the same as in G 
for €;>r > €j41 and otherwise it is zero. 

(c) |Sol(G;)| divides |Go|. 

(d) Sol(G;) = F(G;) for every i. 


Proof. The existence of a series of subgroups satisfying (a), (b) and (c), 
follows from Lemma 1 by an obvious induction argument. 
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Suppose now that Go, G1,...,G, is a series of subgroups of G satisfying 
(a), (b) and (c) such that the orders of the subgroups Gj,..., G; are minimal. 
We claim that this series satisfies (d) as well. 

By a well known observation [35, p. 269] if G is a group, N a normal 
subgroup of G such that no proper subgroup of G has G/N as a factor group 
then N is contained in the Frattini subgroup ®(G) of G. Applying this to G; 
and Sol(G;) we see that Sol(G;) C ®(G;) C F(G;) by the minimality of G; 
for every 7. 


We also need the following useful result [3]. 


Lemma 3. Let G be a permutation group of degree d. If G has no alternating 
composition factors of degree > D (D > 6) then |G| < D‘7!. 


Next we prove a crucial technical lemma. 


Lemma 4. Let the group H be a direct product of the alternating groups 
Alt(d;) (¢ =1,...,k) with r < dj; < 2” for some r > 8. Let A be a subgroup 
of Aut(H) such that A has no alternating composition factors of degree > 2". 
Then 


(a) |A| < |H|?. 
(b) For any p <r (p prime) H contains an elementary abelian p-subgroup of 
order at least 


log p 


|H| arp , 


Proof. A permutes the k factors of H. The kernel of this action K is a 
k 
subgroup of [] Sym(d;) and therefore we have |K| < |H|2‘ (here we used 


i=l 
the fact that Aut(Alt(d)) = Sym(d) for d > 7). The factor group A/K is 
a subgroup of Sym(k) and by Lemma 3 its order is less than (2”)’. On the 
other hand we have |H| > (")' > (2r + 1)! and (a) follows. 

Clearly it is sufficient to prove (b) for the case when H = Alt(d) is an 
alternating group (i.e., t = 1). In this case H contains an elementary abelian 
p-subgroup of order pled, 

If d > 2p then 2p| 4] >dandr > logd imply 2rp|$| > dlog d. Therefore 


log p log p 


piel > (dt) 2» > | Alt(d)| 
2rp 
If d < 2p then we have peer = (2")?? > | Alt(d)| as required. 


Next we will list some basic properties of the generalized Fitting subgroup 
F*(G) (see [2, Chap. 11]). 

Let us recall that a group H is called quasisimple if it is a perfect central 
extension of a simple group JL, i.e, if H = H’ and L ~ H/Z(#H). It is well 
known that if Z is an alternating group of degree at least 8 then |Z(H)| 
divides 2 [67, I Chap. 3, Theorem 2.22]. 
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The components of a group G are its subnormal quasisimple subgroups. 
The subgroup E(G) is generated by the components of G and in fact it is a 
central product of the components. 

The generalised Fitting subgroup is F*(G) = E(G) - F(G) (this again 
turns out to be a central product). The most significant fact about F*(G) is 
that Co(F*(G)) = Z(F*(G)). This means that G/Z(F*(G)) acts faithfully 
on F*(G) by conjugation. 

We also need two easy observations about Sol(G): if C is a normal 
subgroup of G then Sol(C’) = Sol(G)NC and if C C Sol(G) then Sol(G/C) = 
Sol(G)/C. 

Let alt(G) denote the product of the orders of all (nonabelian) alternating 
composition factors of G. 

The following lemma contains a major part of the proof of our Theorem. 


Lemma 5 (Main). Let us fiz an index i,1 <i <t, and consider the group 

G; (as in Lemma 2) and an odd prime p < bi41. Suppose that for C = 

Ca, (Op(Gi)) we have alt(C) > x. Then G contains an abelian p subgroup of 
1 


og P 
order at least x 4?%+1. 


Proof. The centraliser C is a normal subgroup of G; therefore Sol(C) = 
Sol(G;) 1 C. By Lemma 2 (d) we have Sol(G;) = F(G;) therefore Sol(C) is 
the centraliser of O,(G;) in F(G;) = Op(Gi) x Op: (Gi), ie., it is Z(Op(Gi)) x 
Op (Gi). - - 

Consider the factor group C = C/O,/(Gi). If C has an abelian p-subgroup 
P then clearly C has an abelian p-subgroup isomorphic to P. Therefore it is 
sufficient to prove that C’ has a large abelian p-subgroup. 

Now Sol(C’) = Sol(C)/Op(Gi) = Z(Op(Gi)). It follows that Sol(C) = 
F(C) and in fact F(C) = Z(C) (for by definition Z(O,(G;)) is contained in 
the center of C). 

Consider F*(C) = E(C)- F(C). As the nonabelian composition factors 
of C are all alternating groups of degree at least 8 the center of F (C) is a 
2-group. On the other hand Z7(E (C)) is contained in the p-group Sol(C), i.e, 
Z(E(C)) = 1 and E(C) is a direct product of alternating groups. 

C acts by conjugation on F*(C) and as noted above the kernel of this 
action is Z(F*(C)). As we have F(C) = Z(C) it follows that C/Z(F*(C)) 
acts faithfully on E(C). It is also clear that \C/Z(F*(C 3))| > alt(C) > x. 

Using Lemma 4 (a) we obtain that |E(C)| > ./x and by Lemma 4 (b) 


nm log p 
E(C) contains an abelian p-subgroup of order at least «*”+1 . The statement 
of the lemma follows. 


We need two more folklore observations. 


Proposition 4. Let P be a p-group, p an odd prime. Suppose that Aut(P) 
has an elementary abelian section (factor group of a subgroup) of order 2%. 
Then |P| > p®. 


418 Lasl6 Pyber 


Proof. Let S denote the group of automorphisms of P which stabilise 
P/®(P). It is well known that S is a p-group, therefore Aut(P)/S has 
an elementary abelian section of order 2°. However, Aut(P)/S has a 
natural embedding into GL(r,p) where |P/®(P)| = p”. From the structural 
description of the Sylow 2-subgroups of GL(r,p) [16] the inequality a < r 
follows immediately. 


Proposition 5. There exists an absolute constant c such that for any n and 
1<i<t the product of the primes p, Ci41 > p > li+2, is at least 2°%+, 


Proof. This follows, e.g., from the estimates in [65] using 2+? > ¢;44. 


Proof of Theorem 1. Let G be a group of alternating type. Take a subgroup 
G; such that x; = alt(G;) > 28!loslos m1? Vlog n 

Suppose first that for some prime p, £41 > p > 6:42, we have 
alt(Cg,(Op(Gi)) > Vx; By Lemma 5 G; and therefore G contains an abelian 
subgroup of order at least 


1 
Spl; 
x,” wt SS gvlogn 


On the other hand if alt(Cg,(Op(Gi))) < Wa then for A = 
Gi /Ca,(Op(Gi))) we have alt(A) > /z; and A C Aut(O,(G;)). Consider 
the factor group A = A/Sol(A) and H = Soc(A). It is well known that 
Soc(A) is a product of nonabelian simple groups and that A has an embedding 
into Aut(H). Now it is clear that |A| > Jz; and that A and H satisfy the 
conditions of Lemma 4 with r = @;,;. Therefore A has a section which is 

1 

8li41 


an elementary abelian 2-group of order at least x . By Proposition 4 it 


follows that 
log x; 
|Op(Gi)| > pe. 


If this latter inequality holds for all primes p, 0:41 > p > i+2, then using 
Proposition 5 we obtain that 


c 


log @,; 
OG) = "tet SP, 
igi ep>ligi 


Suppose now that G contains no abelian subgroup of order greater than 
2vlee” Tt is clear that t is less than, say, 10loglogn. Therefore the product 
of the a; for which 2; < 28! 08lo871? Vlog” ig Jess than 280!log log n]* Vlogn 


Using Lemma 2 (b) and (c) we obtain that |Go| > (ate )* 


By Lemma 2 (a) this implies that for n sufficiently large |Go|?+= > 
alt(G) |Go| = |G. 
By the result of Heineken [33] mentioned in Sect.2 this implies that G 


a. 
contains a nilpotent subgroup of order at least |G|?C*©. 
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Using Miller’s theorem we obtain that G contains an abelian subgroup of 
order at least 2°V'S" for some € > 0. 

This completes the proof for alternating type groups G and by Lemma 2 
(a) it follows that the Theorem holds for arbitrary finite groups. 


4. Odds and Ends 


By a classical result of P. Hall [31] a p-group P of order p® has derived 
length less than loga. This implies that P has an abelian section of order 
at least |P| Bee, Combining this observation with the results of Sect. 2 we 
obtain that an arbitrary group of order n has an abelian section of order at 


least nGestes™)? for some c > 0. Perhaps much more is true. 


Problem 


Does there exist an ¢ > 0 such that every soluble group G has a class 2 
subgroup of order at least |G|*? 

A group G is said to have property P,, if for each m element subset 
{£1,...,@%m} of G there exists a permutation 7 € Sym(m) such that 71-...- 
Im = Lx(1)*+++* Lx(m)- Let per(G) denote the smallest m such that G has 
property P,,. 

Much is known about groups with per(G) small [9-11]. For example R. 
Brandl [11] proved that |G/F(G)| is bounded in terms of per(G). 

In the opposite direction it follows from some crude estimates (see, 
e.g., [9]) that if H is a subgroup of G then per(G) < |G : H|(|H’| + 1). 
Therefore if G has an abelian section of order x then per(G) is less than, 


say, a; For a group G of order n this gives us per(G) < ni Toate)? (this 
sharpens a result of Brandl [11]). 

It should be possible to improve this estimate to say per(G) < n? for n 
sufficiently large. In fact no examples with per(G) significantly larger than 
logn seem to be known (note however that per(Sym(m)) > m [9]). 

Let us make now a few comments concerning the behaviour of the function 
A;(G) (recall that A;(G) is the number of pairwise commuting ¢-tuples of G). 
Denote by A;(n) the minimum of A;(G) for all groups G of order n. We will 
indicate how one can give a lower bound for A;(n) when n is a prime-power. 


Proposition 6. For every integer t > 1 there exists a constant cp such that 
Ai(p*) = [cra * |p. 


Sketch of the proof. Take a group P of order p*. Define a series of elements 
91,-++;9a as follows. If we are given gi,...g;-1 then consider the subgroup 
P,_, they generate (P;-1 turns out to be a normal subgroup of P of order 
p'—+). If g; is an element of order p in the center of P/P;_; then any element 
of the corresponding coset of P; can be taken as g;. 
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It is clear from the definition that g,,...,gq are pairwise non-conjugate 
elements and g; has at most p’ conjugates. 
Now it is obvious that A;(P) = )>,- p At-1(Cp(g)). As |P : Cp(gi)| is 


the number of conjugates of g; it follows that ae > ba Ace t) and 


gEP 


that |Cp(gi)| > p**. 
By induction we obtain that A,(p*) > p® > |e:_1(a — i)'~?] and our 


statement follows easily. 


It would be most interesting to find an infinite series of p-groups P; of 
order p™, p fixed a; — co for which 4 A < ca; for some constant c. (Note 
again that A2(P;)/|P;| is roughly the same as k(P;).) 

It is not clear how fast A;(n) should grow in general for t > 3. However, 


(Ae@)) =r can at least be much smaller than n. 


we observe that 
Proposition 7. A;(Sym(m)) < cm!3*5 for some c> 0. 


Proof. As Dixon [21] observed the number of maximal abelian subgroups of 
Sym(m) is less than m! and the maximal order of an abelian subgroup is 
roughly 3. Our statement follows. 


As noted in Sect.1 if a group G contains at most n pairwise non- 
commuting elements then |G/Z(G)| < c” for some large ¢ and therefore 
G can be covered by c” abelian subgroups. Erdés [24] suggested that the 
value of c in [60] should be further improved. 

We would like to point out that such an improvement (and an essential 
simplification of the proof in [60]) could be obtained by solving a problem of 
Tomkinson [69] in the special case of abelian p-groups. Tomkinson suggests 
that if a group G has an irredundant covering by n subgroups Hj,..., Hn 
then |G : (H, MN... Hn)| < ch should hold (and that co should be small). 
Let us see how the proof of this conjecture for abelian groups G would help us. 

It is proved in [69] that if {a1,...,a,} is a set of pairwise non- 
commuting elements of a group G having maximal size then the centralisers 
Ca(a#1),---,Ce@(an) form an irredundant covering of G and their intersection 
coincides with Z(G). Suppose now that P is a class 2 p-group (i.e., P’ < 
Z(P)) and that Tomkinson’s conjecture holds for the abelian group G/Z(G). 
The subgroups Cp(z;)/Z(P) form an irredundant covering of P/Z(P) with 
intersection 1, i.e., we would have |P/Z(P)| < co. 

By some preliminary results in [60] if |P/Z(P)| < c§ holds for class 2 
p-groups P then for an arbitrary group G we have |G/Z(G)| < cRgloolesn)* 

For a better understanding of the above problems it would also be useful 
to consider their analogues for Lie rings. 
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Summary. In this paper we continue the study of the method of approximations 
in Boolean complexity. We introduce a framework which naturally generalizes 
previously known ones. The main result says that in this framework there exist 
approximation models providing in principle exponential lower bounds for almost 
all Boolean functions, and such that the number of testing functionals is only singly 
exponential in the number of variables. 


1. Introduction 


Proving superpolynomial lower bounds on the complexity of explicitly given 
Boolean functions is one of the most challenging tasks of the modem 
complexity theory. Its importance stems from the fact that such bounds could 
be easily translated into similar bounds for Turing models and, thus, would 
lead to resolving central open questions in Complexity Theory like P NP 
or NC=P. 

At the moment, however, we have succeeded in proving desired bounds 
only for rather restrictive models. A substantial part of these bounds was 
obtained via a general scheme originally proposed in [16, 17] and called 
afterwards the method of approximations: 


— On the monotone circuit size—[1, 12, 14-17]; 

— For bounded-depth circuits with modular gates—[2, 11, 18]; 

— For switching-and-rectifier networks (= nondeterministic branching 
programs)—|19]; 

— For @-branching programs (see [7] for definitions)—{6]. 


The reader willing to learn more about these and related results or about the 
general perspective of the field is referred to the survey paper [3]. 

Concrete approximation models which have appeared in the literature 
can be naturally subdivided into two large groups. 

Models from the first group use inputs of the original function as their 
error tests. Such are models from [1, 2, 11, 12, 14-18]. We will call the method 
based on models of this kind the pure approximation method. 

Other models use as error tests specially designed functionals, every 
functional being attached to a single input. These models were studied, 
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and sometimes actually used in [4-6, 8, 9, 19]. See [13] for an extended 
survey; following this source, we will call the corresponding method the fusion 
method. The same word “fusion” will be also used for functionals and models. 

The most interesting question is, of course, to which extent the approx- 
imation method might be useful in proving lower bounds for unrestricted 
circuits. To that end, it was shown in [9] that the pure approximations can 
not prove bounds greater than O(n”) or, more precisely, O(n - no), where 
Ng <n is the number of essential variables of our function. Since every fusion 
model with N functionals can always be considered as a pure approximation 
model with no := n and n := n+ [logy N] (see [9, Claim 2.5}), it follows that 
lower bounds provable by any such fusion model never exceed O(n?+n log N). 

On the other hand, in [9] for every function f a fusion model was exhibited 
which, at least in principle, provides tight, up to a polynomial, lower bounds 
on the circuit size of f. The number of fusing functionals involved in that 
model was triply exponential in n, and it was also remarked in [9] that it 
can be decreased to doubly exponential. The resulting model, however, is 
somewhat artificial. 

More natural fusion model with the number of fusing functionals being 
only doubly exponential in n has been found in [5]. Their model is universal 
for nondeterministic circuit size, hence it still can prove exponential lower 
bounds for almost all Boolean functions. Note that. in view of the above- 
mentioned limitation O(n? + nlog N), this is roughly optimal for fusion 
models. 

In this paper we study the question whether there exists a natural version 
of the method of approximations in which proving exponential lower bounds 
is possible (again, in principle) with the number of fusion functionals being 
only singly exponential in the number of variables. We indeed find such a 
framework generalizing both pure approximations and fusion models. In fact, 
our framework is obtained by cleaning the underlying idea of approximations 
from the prejudice of attaching error tests to particular input strings which 
is characteristic for previous models. 

More exactly, we show that for every integer-valued function t = t(n) 
in the range n < t(n) < - there exists an approximation model 9 (in 
our framework) with O(t? log? t) error tests such that for almost all Boolean 
functions f, p(f, Mt) > t, where p(f, 2) is the distance between f and IN 
(Theorem 2). 

The main motivation for this work comes from [10], where I put forward 
the thesis that the right theory capturing the kind of machinery existing 
in Boolean complexity at the moment is the second-order system V;). This 
system can freely talk of those approximation models in which the number 
of error tests is bounded by 20‘ (and thus error tests can be represented 
by first order objects). Hence, unlike previous models, the models considered 
in this paper are within the reach of V;! in terms of size. It should be noted, 
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however, that gaining in size we lose in the constructibility. Indeed, our proof 
heavily relies upon Erdos probabilistic argument, and in order to carry it over 
in V;! we need an explicit construction. 


2. Definition of Approximation Models 


Throughout the paper, F;, stands for the set of all Boolean functions in n 
variables. Let P, = {1,...,2n,701,-.-.,7%n} C Fy, be the set of input 
functions. 

Let F be a finite set of arbitrary nature. We define an approximation 
model IN as a subset It C F, x P(F) such that 


P, x {0} CM (1) 


supplied with two binary operations A, V which are consistent with the 
projection onto F;,. In other words, we require 


f (m1 * m2) = f(m1) * f(m2); mi,m2 € M, (2) 


where « € {A,V}, and we once and for all have fixed notation f(m) for 
denoting the projection of m € SN onto the first coordinate F,,. Similarly, we 
will denote the projection onto P(F) by F(m) so that m = (f(m), F(m)). 

Now we give a set of definitions which is routine for the method of 
approximations. Namely, let 


d4(™m1,M2) = F(my, * m2) \ (F (m1) U F(ma)); m1, m2 © M, 
A = {6.(m1, m2) | * € {A, V}; m1, m2 € Me}, 


p(f, 9D) = nin mE Mine ed (Fem) =JFm) Ua) 
i=l 


(3) 
The intuitive idea behind this is that if the real circuit computes some 
function f at a node u, then the approximating circuit must compute at u 
some m € M with f(m) = f (due to (2)). Now, all tests F € F(m) have 
already found “an error”, that is F(m) C U,, 5., where the union is extended 
over all nodes v lying below u, and 6, € A naturally corresponds to the node 
v. For the reader familiar with previous analogous statements, this should 
serve as a self-sufficient proof of the following 


Theorem 1. For every f € F, and every approximation model MN, we have 
p(f, Wt) < C(f), where C(f) is the minimal possible size of a circuit over the 
basis A, V with inputs from P,, computing f. 

We conclude this section by showing that our new framework generalizes 
both pure approximations and the fusion method. 


Example 1. Let (M,A,V) be a legitimate model [9, Sect. 2]. Here Py C 
M C F, and A,V are arbitrary binary operations on M. Recall that for 
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gh eM and x € {A,V}, the subsets 5+ (g,h), 6-(g,h) of {0,1}” are defined 
as follows: 

66 (Gh) = (G*h)\ Gh), 

5. (G,h) = (G%h)\ (G*h) 
(we identify a Boolean function with its set of ones). For f € F;,, the distance 
p(f,M) is the minimal t for which there exist f,G1,.--,Gt,hi,..., he © M 
and *1,...,*, © {A, V} such that 


Sy 
aw 
S| 
IN 
mH 
e+ 
= 
a 
= 


F\fF © U&,Geh). 
i=1 


The quantity p(f,M) provides a lower bound on the circuit size of f. 

Take now two disjoint copies BY, B™ of {0,1}", and let F = BY UB". 
Consider the product F,, x M of two {A, V}-algebras, and embed it into F,, x 
P(F) = Fy x P(BY) x P(B®) as follows: 


nw:Fy,xM —> F,, x P( Bt) x P(B®), 


Gap Fe Ate \ iad Vo) 
We let MN = im(m) and endow M with the structure of {A, V}-algebra induced 
from F,, x M. Note that (1) is implied by P, CM. 
Assume that m1,m2 € IN; m, = 7(g,9), m2 = a(h,h). Representing 
dx(™M1,Mz2) in the form 6f(m1,mz2) U6, (m1,mz2), where 6°(m1,mz2) C B®, 
we have: 


C (gh) \ (G#h) = 5G, h) 


and similarly for 6; (m1,mz2). Noting that the condition 


Bm eM 3H. HEA (Fm) = F4F¢m) SLs (4) 


from (3) can be rewritten in the form 


Af eM 3x € {A,V} Am m? Emm 


(F< Us (mM, m) &F\ FC Uo (nfm) . 
val i=l 


we immediately see that p(f,M) < p(f, MN). In other words, every legitimate 
model in the sense of [9] can be simulated in our framework. 


On Small Size Approximation Models 429 


Example 2. Let us now turn to the fusion method. In fact, we might first 
apply the construction from [9] to get a pure approximation model, and then 
the construction from Example 1. Things become much more transparent, 
however, if we combine the two steps into one. Recall some necessary 
definitions [9, 13]. 

Let f € F,, be a fixed function, U = f~1(0), and V = f7'(1). Let Qf C 
{0,1}? consist of those functionals on P(U) which satisfy the following 
two conditions: 


1. F is monotone, 
2. There exists a (uniquely determined) z(F') € V such that for alla € Pn, 


F(aj|u) = #;(2(F)). 


Here, as usual, x} = 2; and x? = (72;). 

Note that these two conditions imply F(0) =0 and F(U) = 1. 

For g,h € {0,1}" we say that the pair (g,h) covers F € Qf if F(9) = 
F(h) = 1 and F(gAh) =0. The minimal number of pairs needed to cover 
the whole Q,y is denoted by p(f) and provides a lower bound on the circutt 
size of f which is tight up to a polynomial. 

Define now the mapping 


T:F, + P(QF), 
g > {F | g(2(F)) =1& F(glu) = 0}. 


Note that 7(g) =@ when g € Pn, and x(f) =Q,. 

Let F = Qy. We take the diagonal mapping 6: F, > F, x P(QF); 
g¥ (g,7(g)), denote IN = im(P) and endow M with the induced structure 
of {A, V}-algebra. Equation (1) is implied by the remark above. 

Now, 5y(0(g),0(h)) = 0 due to the monotonicity of every F € Oy. If 
F € 5,(0(g),0(h)) then o((F)) = h(2(F)) =1 and F(g A A)|u) = 0. Since 
F ¢ x(g) and F ¢ r(h), we have F(g|y) = F(hlu) = 1. Hence the pair 
(g JU; h|U) covers F. As (4) in our case simplifies to 30,,...,0, € A(F C 


Ul-1 51), we see that p(f) < p(f,M). 


Another version of the fusion method using GF‘(2)-affine functionals 
instead of monotone functionals was proposed in [5, 6]. It can also be 
embedded into our framework if we consider approximation models over the 
basis {A, ®} rather than over {A, V}. 


3. Main Result 


In this section we prove the following: 


Theorem 2. Let t = t(n) be an integer-valued function in the range n < 
i(n) <5 2” 3x Lhen there exists an approximation model MN C Fn x P(F), where 


|F| < O(#? log? t), such that p(f,M) > t(n) for almost all junctions f € Fy. 


430 Alexander A. Razborov 


Proof. Let ¢= |20t? In? t| and S = (f). Fix a set F of cardinality 2. 
For a subset 90 of F, x P(F) and Fo C F, we let 


M(Fo) = {m € M| F(m) C Fo} 
and 
m (Fo) = |I(Fo)|- 
Let also 
wm = In(Efe’™(7°))), 


where F, C F is a random subset of cardinality t. 
We are going to define by induction on k a sequence 


P, x {0} =M, CM, C... CMe C... C Fn x P(F) (5) 


along with binary operations Ax, Vz : Vte_1 < Wte_-1 4 Mty maintaining the 
following properties: 


1. Ifk <k’ then Akt lomte_1 xe = A, and Var lotp_1 xe _4 =VrE 
2. f(my *% M2) = f(m1) * f(m2) for m1, m2 © Me_1; 
3. For every m1,m2 € Mte_1 and * € {A, V}, 


|F (ma *k M2) \ (F (m1) U F(ma))| < 1; 


4. For every m © We \ Me_1, |F(m)| > min(k, 2); 
5. wo, < 2(n+k). 


Base k = 0 is obvious. 

Inductive step. Assume that 9t,,9t,...,0te_1,0te and Az, Vz 
Me_1 x Me_1 + MWte are already defined. Then we randomly extend Ax, Vx to 
Ag+i, Vet 2 te X Me > Fn X P(F) as follows. For (m1,™m2) E (Me x Me) \ 
(Me_1 x Mr_1) we let 


M1 *e41 M2 = (f(m1) * f(mM2), F(m1) UF (ma) U {Fs (m1, m2)}), 


where F,.(m , m2) is chosen at random from F \ (F(mi)UF(mz2)) if F(m1)U 
F(m2) # F and arbitrarily otherwise. All F,(m1,m2) are assumed to be 
independent. 

After this we let 


Mess = Me U im(Ag+1) U im(Vg41)- (6) 


Properties 1—3 readily follow from definitions, and 4 follows from the inductive 
assumption. We are going to show that 5 (with k := k + 1) also takes place 
with a non-zero probability. 

We may assume that k +1 < t since otherwise property 5 follows from 
4 and the inductive assumption. For simplicity we will abbreviate wo, (Fo) 
and wo, to w;(Fo), wi respectively. 
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Let us first fix some Fy C F of cardinality t and estimate E[e¥*+170)] 
for this particular Fo. Denote the set 


{(m1, m2, *) | (m1, m2) € (Me(Fo) x Me(Fo)) \ (Me-1 x Mte_1),* € {A, V}} 
by A. Then |A] < 2w?(Fo) and 

Mer1(Fo) = Me( Fo) UV {m1 *K41 $Me | (M1, M2, *) EC U& F.(m1, m2) € Fo}. 
Hence 


we+i(Fo) <we(Fo)+  S>  &(mmi,me), (7) 


(m1,m2,*)EA 


where &,(m1,™mz2) is the indicator function of the event F..(m1, m2) € Fo. 
All &.(m1,mz) are, however, independent. Therefore (7) gives us the 
estimate 


Ele’*+! (Fo)] ze ewk (Fo) . II Blefest(m1ma)] 


(m1,m2,*)EA 


2w?(F) 
< wk (Fo) ‘ (1 +4 a . 


< ewe (Fo) + F wx (Fo). 


Averaging this inequality over Fo, we have 
Ble@*+1(Fo)] < Bets (Fo) + % we (Fo), 
Now we fix a particular choice of !te+1, with the property 
eWk+ = Blewe+(Fo)] < Blew Fo)t Fw (Fo), (8) 
We will show that this implies the desired inequality wri. < 2n+ 2k + 2 if 
k+1<t. 


Let us denote e”* (Fo) by 6; (Fo). Then the inductive assumption can be 
rewritten in the form 


E[9x(Fo)] < 2", (9) 
and (8)—in the form 
Wht < E[0q(Fo) et IY nF), (10) 


The function x - gi’ a. where a = e7 is, however, convex on [1, co). Hence, 
under the condition (9), the right-hand side of (10) achieves its maximal 
value when 6;(Fo) takes on (S — 1) times the value 1, and the remaining 
time—the value S- e("+®) _§$+1< §- 2+), This gives us the estimate 


eur < p= + e2(n+k) . et (In S+2(n+k))?_ 
~ § 
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Finally, 


t At 
Wert < In(1 + e2(" +). ¢F (in S+2(n+k))” ) < 14+2(n k)+ : (In $ 2(n k))? 


and it is easy to see that #(In S$ + 2(n+k))? < 4(In $+ 2(n+t))? < 1 due 
to our choice of parameters. 

When we have the desired sequence (5), the rest is easy. We let It = 
Ups, te. Property 1 ensures that we can glue together the partial operations 
Ar, Ve to endow St with a natural structure of {A, V}-algebra. Property 2 
gives us (2), and Property 3 lets us to conclude that Vd € A, |6| < 1. Hence, 
if p(f,m) <t for some f € F, then dm € M(f(m) = f &|F(m)| < t). Due 
to Property 4, we may replace here Nt by M,. 

However, the total number of m € IN, that |F(m)| < t does not exceed 


S> wi(Fo)- 


FoGF 
|Fol=t 


Since eW*70< 9. e@*<§- 2+ by Property 5, we have that this number 
is bounded from above by S(In $+2(n+t))<o(2?"). The theorem follows. lll 


4. Conclusion 


The most interesting open question is, of course, whether the proof of 
Theorem 2 can be made constructive. The connection with V;) mentioned 
in Introduction suggests the following specific form of this question. 

Can we find a good approximation model 29N such that, as a subset of 
F,, x P(F), it is recognizable in polynomial time, and the operations /, V are 
polynomially time computable? Here “good” means “such that p(fn, 2) > 
n”) for some choice of fn € Fy”, and “polynomial” means “polynomial 
in 2”. 


References 


1. N. Alon and R. Boppana. The monotone circuit complexity of Boolean 
functions. Combinatorica, 7(1):1-22, 1987. 

2. D. A. Barrington. A note on a theorem of Razborov. Technical report, University 
of Massachusetts, 1986. 

3. R. B. Boppana and M. Sipser. The complexity of finite functions. In Jan van 
Leeuwen, editor, Handbook of Theoretical Computer Science, vol. A (Algorithms 
and Complexity), chapter 14, pages 757-804. Elsevier Science Publishers B.V. 
and The MIT Press, 1990. 

4. M. Karchmer. On proving lower bounds for circuit size. In Proceedings of the 
&th Structure in Complexity Theory Annual Conference, pages 112-118, 1993. 


On Small Size Approximation Models 433 


15. 


16. 


17. 


18. 


19. 


. M. Karchmer and A. Wigderson. Characterizing non-deterministic circuit 
size. In Proceedings of the 25th Annual ACM Symposium on the Theory of 
Computing, pages 532-540, 1993. 

. M. Karchmer and A. Wigderson. On span programs. In Proceedings of the 8th 
Structure in Complexity Theory Annual Conference, pages 102-111, 1993. 

.C. Meinel. Modified Branching Programs and Their Computational Power, 
Lecture Notes in Computer Science, 370. Springer-Verlag, New York/Berlin, 
1989. 

. K. Nakayama and A. Maruoka. Loop circuits and their relation to Razborov’s 
approximation model. Manuscript, 1992. 

. A. Razborov. On the method of approximation. In Proceedings of the 21st ACM 
Symposium on Theory of Computing, pages 167-176, 1989. 

. A. Razborov. Bounded Arithmetic and lower bounds in Boolean complexity. To 
appear in the volume Feasible Mathematics II, 1993. 

. R. Smolensky. Algebraic methods in the theory of lower bounds for Boolean 
circuit complexity. In Proceedings of the 19th ACM Symposium on Theory of 
Computing, pages 77-82, 1987. 

_ E. Tardos. The gap between monotone and nonmonotone circuit complexity is 
exponential. Combinatorica, 8:141—-142, 1988. 

. A. Wigderson. The fusion method for lower bounds in circuit complexity. In 
Combinatorics, Paul Erdos is Eighty. 1993. 

. A. E. Andreev. Ob odnom metode polugeni ninih ocenok slonosti individualnyh 

monotonnyh funkci. DAN CCCP, 282(5):1033-1037, 1985. A.E. Andreev, On a 

method for obtaining lower bounds for the complexity of individual monotone 

functions. Soviet Math. Dokl. 31(3):530-534, 1985. 

A. E. Andreev. Ob odnom metode polugeni ffektivnyh ninih ocenok monotonno 

slonosti. Algebra i logika, 26(1):3-21, 1987: A.E. Andreev, On one method 

of obtaining effective lower bounds of monotone complexity. Algebra i logika, 
26(1):3-21, 1987. In Russian. 

A. A. Razborov. Ninie ocenki monotonno slo nosti nekotoryh bulevyh funkci. 

DAN CCCP, 281(4):798-801, 1985. A. A. Razborov, Lower bounds for the 

monotone complexity of some Boolean functions, Soviet Math. Dokl., 31:354— 

357, 1985. 

A. A. Razborov. Ninie ocenki monotonno slonosti logiqeskogo permanenta. 

Mamem 8am., 37(6):887—900, 1985. A. A. Razborov, Lower bounds of monotone 

complexity of the logical permanent function, Mathem. Notes of the Academy 

of Sci. of the USSR, 37:485-493, 1985. 

A. A. Razborov. Ninie ocenki razmera shem ograniqenno glubiny v polnom 

bazise, soderawem funkci logigeskogo sloeni. Mamem Sam., 41(4):598- 607, 

1987. A. A. Razborov, Lower bounds on the size of bounded-depth networks 

over a complete basis with logical addition, Mathem. Notes of the Academy of 

Sct. of the USSR, 41(4):333-338, 1987. 

A. A. Razborov. Ninie ocenki slonosti realizacii simmetriqeskih bulevyh 

funkci kontaktno-ventilnymi shemami. Matem, 3am., 48(6):79-91, 1990. A. A. 

Razborov, Lower bounds on the size of switching-and-rectifier networks for 

symmetric Boolean functions, Mathem. Notes of the Academy of Sci. of the 

USSR. 


The Erdos Existence Argument 


Joel Spencer 


J. Spencer (&) 
Courant Institute, New York University, New York, NY 10012, USA 
e-mail: spencer@cims.nyu.edu 


Summary. The Probabilistic Method is now a standard tool in the combinatorial 
toolbox but such was not always the case. The development of this methodology was 
for many years nearly entirely due to one man: Paul Erdés. Here we reexamine some 
of his critical early papers. We begin, as all with knowledge of the field would expect, 
with the 1947 paper Erdés P (1947) Some remarks on the theory of graphs. Bull 
Amer Math Soc 53:292-294 giving a lower bound on the Ramsey function R(k, k). 
There is then a curious gap (certainly not reflected in Erdés’s overall mathematical 
publications) and our remaining papers all were published in a single ten year span 
from 1955 to 1965. 


1. 1947: Ramsey R(k, k) 


Let us repeat the key paragraph nearly verbatim. Erdés defines R(k,1) as the 
least integer so that given any graph G of n > R(k,1) vertices then either 
G contains a complete graph of order k or the complement G’ contains a 
complete graph of order J. 


Theorem 1. Let k > 3, then 


2k —2 . 
Qk/?2 < R(k,k) < i . ear, 


Proof. 'The second inequality was proved by Szekeres thus we only consider 
the first one. Let N < 2”/?. Clearly the number of graphs of N vertices equals 
2N(N-0)/2. (We consider the vertices of the graph as distinguishable.) The 
number of different graphs containing a complete graph of order k is less than 


(*" 9N(N-1)/2 Nk 9N(N-1)/2 9N(N-1)/2 


k ] Qk(k-1)/2 » kl Qk(k-1)/2 = 2 (*) 


since by a simple calculation for N < 2*/? and k > 3 
ON® 2 pareve. 


But it follows immediately from (*) that there exists a graph such that neither 
it nor its complementary graph contains a complete subgraph of order k, 
which completes the proof of the Theorem. 
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Erdos used a counting argument above, in the more modern language we 
would speak of the random graph G ~ G(n,p) with p = 5: The probability 
that G contains a complete graph of order k is less than 


N\ 5-«(e-1)/2 — N* 5-ne—1y/2 — 1 


(calculations as in the original paper) and so the probability that G or G’ 
contains a complete graph is less than one so that with positive probability 
G doesn’t have this property and therefore there exists a G as desired. Erdés 
has related that after lecturing on his result the probabilist J. Doob remarked 
“Well, that’s very nice but it really is a counting argument.” For this result the 
proofs are nearly identical, the probabilistic proof having the minor advantage 
of avoiding the annoying 2N(V—!)/ factors. Erdés writes interchangeably in 
the two styles. As the methodology has progressed the probabilistic ideas 
have become more subtle and today it is quite rare to see a paper written in 
the counting style. We’ll take the liberty of translating Erd6s’s later results 
into the more modern style. 

The gap between 2*/? and 4* for R(k,k) remains one of the most 
vexing problems in Ramsey Theory and in the Probabilistic Method. All 
improvements since this 1947 paper have been only to smaller order terms 
so that even today lim R(k, k)!/* could be anywhere from V2 to 4, inclusive. 
Even the existence of the limit has not been shown! 


2. 1955: Sidon Conjecture 


Let S be a set of positive integers. Define f(n) = fs(n) as the number of 
representations n = «+ y where «, y are distinct elements of S. We call 
S a basis if f(n) > 0 for all sufficiently large n. Sidon, in the early 1930s, 
asked if there existed “thin” bases, in particular he asked if for all positive ¢ 
there existed a basis with f(n) = O(n*). Erdés heard of this problem at 
that time and relates that he told Sidon that he thought he could get a 
solution in “a few days”. It took somewhat longer. In 1941 Erdos and Turan 
made the stronger conjecture that there exists a basis with f(n) bounded 
from above by an absolute constant—a conjecture that remains open today. 
In 1955 Erdés [1] resolved the Sidon conjecture with the following stronger 
result. 


Theorem 2. There exists S with fg(n) = O(Inn). 


Proof. The proof is probabilistic. Define a random set by Pr[a € S] = pz, the 
events being mutually independent over integers x, setting 


Px = x(**) ue 
ax 
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where K is a large absolute constant. (For the finitely many x for which this is 
greater than one simply place « € S.) Now f(n) becomes a random variable. 
For each x < y with 7+ y =n let Iz, be the indicator random variable for 
x,y € S. Then we may express f(n) = >> Iny. From Linearity of Expectation 


Elf(n)] = > Eley] = >> pepy ~ K' Inn 


by a straightforward calculation. 
Lets write pp = p(n) = E[f(n)]. The key ingredient is now a large deviation 
result. One shows, say, that 


Pr[f(n) < 5H] <e-* 


Pri[f(nm) > 2u] < e 


where c is a positive absolute constant, not dependent on n, K or p. This 
makes intuitive sense: as f(n) is the sum of mutually independent rare 
indicator random variables it should be roughly a Poisson distribution and 
such large deviation bounds hold for the Poisson. Now pick K so large that 
kK’ is so large that cu > 2Inn. Call n a failure if either f(n) > 2 or 
f(n) < w/2. Each n has probability less than 2n~? failure probability. By 
the Borel-Cantelli Lemma (as }> n~? converges) almost surely there are only 
a finite number of failures and so almost surely this random S$ has the desired 
properties. 


While the original Erdés proof was couched in different, counting, 
language the use of large deviation bounds can be clearly seen and, on this 
count alone, this paper marks a notable advance in the Probabilistic Method. 


3. 1959: High Girth, High Chromatic Number 


Tutte was the first to show the existence of graphs with arbitrarily high 
chromatic number and no triangles, this was extended by Kelly to arbitrarily 
high chromatic number and no cycles of sizes three, four or five. A natural 
question occurred—could graphs be found with arbitrarily high chromatic 
number and arbitrarily high girth—i.e., no small cycles. To many graph 
theorists this seemed almost paradoxical. A graph with high girth would 
locally look like a tree and trees can easily be colored with two colors. What 
reason could force such a graph to have high chromatic number? As we'll see, 
there is a global reason: y(G) > n/a(G). To show x(G) is large one “only” 
has to show the nonexistence of large independent sets. 

Erdés [2] proved the existence of such graphs by probabilistic means. Fix 
1, k, a graph is wanted with y(G) > / and no cycles of size < k. Fix e < i, 
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set p = n*—! and consider G ~ G(n,p) as n + oo. There are small cycles, 
the expected number of cycles of size < k is 


(Mis _— Olinpy') = 
5p! = S>O((np)') = o(n) 


k 
1=3 1=3 


as ke < 1. So almost surely the number of edges in small cycles is o(n). Also 
fix positive n < €/2. Set |u = n'~"|. A set of u vertices will contain, on 
average,  ~ u2p/2 = Q(n%) edges where a = 1 +¢—2n > 1. Further, the 
number of such edges is given by a Binomial Distribution. Applying large 
deviation results, the probability of the wu points having fewer than half their 
expected number of edges is e “. As a > 1 this is smaller than exponential, 
so 0(2~") so that almost surely every u points has at least j/2 edges. We 
need only that 4/2 > n. 

Now Erdos introduces what is now called the Deletion Method. This 
random graph G almost surely has only o(n) edges in small cycles and every 
u vertices have at least n edges. Take a specific graph G with these properties. 
Delete all the edges in small cycles giving a graph G~. Then certainly G™ 
has no small cycles. As fewer than n edges have been deleted every u vertices 
of G~, which had more than n edges in G, still has an edge. Thus the 
independence number a(G7) < u. But 


x(G~) > 


eer 
a(G-) ~ u 
As n can be arbitrarily large one can now make y(G7) > k, completing the 
proof. 

The use of counting arguments became a typographical nightmare. Erdés 
considered all graphs with precisely m edges where m = |n!**|. He needed 
that almost all of them had the property that every u vertices (wu as above) 
had more than n. The number of graphs failing that for a given set of size u 
was then 


¥ () (O29) <m+n(®) (7) 
co) <(@)(-B) 


< (Jaom(— 28)" < ()uome-mee 


Now the number of possible choices for the wu points is 


nr 
(7) cnt cw 
U 
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and so the number of graphs without the desired property is 


(Sheree =o) 


as desired. Today, with large deviation results assumed beforehand, the proof 
can be given in one relatively leisurely page. 

Many consider this one of the most pleasing applications of the Proba- 
bilistic Method as the result seems not to call for probability in the slightest 
and earlier attempts had been entirely constructive. The further use of large 
deviations and the introduction of the Deletion Method greatly advanced the 
Probabilistic Method. And, most important, the theorem gives an important 
truth about graphs. In a rough sense the truth is a negative one: chromatic 
number cannot be determined by local considerations only. 


4. 1961: Ramsey R(3, k) 


Ramsey Theory was one of Paul Erdés’s earliest interests. The involvement 
can be dated back to the winter of 1932/33. Working on a problem of Esther 
Klein, Erdés proved his famous result that in every sequence of n? + 1 real 
numbers there is a monotone subsequence of length n+ 1. At the same time, 
and for the same problem, George Szekeres rediscovered Ramsey’s Theorem. 
Both arguments appeared in their 1935 joint paper [10]. Bounds on the 
various Ramsey functions, particularly the function R(l,k), have fascinated 
Erdés ever since. We have already spoken of his 1947 paper on R(k,k). In 
his 1961 paper Erdés [3] proves 


k? 
R(3,k) > ca 
oon) ea 


The upper bound R(3,k) = O(k?) was already apparent from the original 
Szekeres proof so the gap was relatively small. Only in 1994 was the correct 
order R(3,k) = @(£5) finally shown. 

Erdés shows that there is a graph on n vertices with no triangle and no 
independent set of size « where 2 = [An'/? Inn], and A is a large absolute 
constant. This gives R(3,x) > n from which the original statement follows 
easily. We’ll ignore A in our informal discussion. He takes a random graph 
G(n,p) with p = cn~'/?. The probability that some 2z-set is independent is 
at most 


(") (1 =p? < [ne~P(e-1)/2) 
av 


which is very small. Unfortunately this G will have lots (Q(n3/2)) of triangles. 
One needs to remove an edge from each triangle without making any of the 
x-sets independent. 


440 Joel Spencer 


The Erdés method may be thought of algorithmically. Order the edges 
€1,---,€m of G ~ G(n,p) arbitrarily. Consider them sequentially and reject 
e; if it would make a triangle with the edges previously accepted, otherwise 
accept e;. The graph G~ so created is certainly triangle free. What about 
the sets of x vertices. Call a set S of x vertices good (in G, not G~) if it 
contains an edge e which cannot be extended to a triangle with third vertex 
outside of S. Suppose S is good and let e be such an edge. Then S cannot be 
independent in G~. If e is accepted we’re clearly OK. The only way e could 
be rejected is if e is part of a triangle e,e1,e2 where the other edges have 
already been accepted. But then e1, e2 must (as S is good) lie in S and again 
S is not independent. 

Call S bad if it isn’t good. Erdds shows that almost always there are 
no bad S. Lets say something occurs with high probability if its failure 
probability is o(1/(")). It suffices to show that a given 9 = {1,...,z} is 
good with high probability. This is the core of the argument. We expose (to 
use modern terminology) G in two phases. First we examined the pairs {s, t} 
with se S,t¢S . For each t ¢ S let d(t) be the number of edges to S. Set 


oe 2) 


tS 


Each d(t) has Binomial Distribution B(x, p) and so expectation xp = O(Inn) 
so that one can get fairly easily E[Z] = @(nIn*n). Note this is the same 
order as x”. It is definitely not easy to show that for appropriate A, c (Erdés 
takes c = A~!/? and A large) that Z < 4(%) with high probability. The 
requirement “with high probability” is quite severe. But note, at least, that 
this is a pure probability statement. Lets accept it and move on. Call a pair 
{i,j} C S soiled if it lies in a triangle with third vertex outside of S. At most 
Z pairs are soiled so with high probability at least $(5) pairs are unsoiled. 
Now we expose the edges of G inside S$. If any of the unsoiled pairs are in G 
then G is good and so the failure probability is at most 


re) 


and so G is good with high probability. 

Sounds complicated. Well, it is complicated and it is simultaneously 
a powerful application of the Probabilistic Method and a technical tour 
de force. The story has a coda: the Lovadsz Local Lemma, developed in 
the mid-1970s, gave a new sieve method for showing that a set of bad 
events could simultaneously not hold. This author applied it to the random 
graph G(n,p) with p = cn—!/? with the bad events being the existence of 
the various potential triangles and the independence of the various x-sets. 
The conditions of the Local Lemma made for some calculations but it was 
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relatively straightforward to duplicate this result. Still, the ideas behind this 
proof, the subtle extension of the Deletion Method notion, are too beautiful 
to be forgotten. 


5. 1962: No Local Coloring 


With his 1957 paper previously discussed Erdés had already shown that 
chromatic number cannot be considered simply a local phenomenon. With 
this result he puts the nail in the coffin. 


Theorem 3 ([4]). For any k > 3 there is an ¢ > 0 so that the following 
holds for all sufficiently large n: There exists a graph G on n vertices which 
cannot be k-colored and yet the restriction of G to any en vertex subgraph 
can be 8-colored. 


Often probabilistic theorems are best understood as negative results, as 
counterexamples to natural conjectures. A priori, for example, one might 
conjecture that if every, say, n/(Inn) vertices could be 3-colored then G could 
be 4-colored. This theorem disproves that conjecture. 

We examine the random graph G ~ G(n,p) with p = c/n. As in the 1957 
paper 


Pr[a(G) 2 2] < (") (1 —p)@) < [(ne/x)emP@-D/2]", 


When c is large and, say, x = 10n(Inc)/c, the bracketed quantity is less than 
one so the entire quantity is o(1) and a.s. a(G) < x and so x(G) > c/(10 Inc). 
Given k Erdés may now simply select c so that, with p = c/n, x(G) > k as. 

Now for the local coloring. If some set of < en vertices cannot be 3-colored 
then there is a minimal such set S' with, say, |S| = 7 < en. In the restriction 
G|s every vertex v must have degree at least 3—otherwise one could 3-color 
S — {v} by minimality and then color v differently from its neighbors. Thus 
Gls has at least 3i/2 edges. The probability of G having such an S is 
bounded by 


= Qa" sd[H5) @)"] 


employing the useful inequality ({) < ($¢)°. Picking « = e(c) small the 
bracketed term is always less than one, the entire sum is o(1), a.s. no such S 
exists, and a.s. every en vertices may be 3-colored. 

Erdés’s monumental study with Alfred Rényi “On the Evolution of 
Random Graphs” [8] had been completed only a few years before. The 
behavior of the basic graph functions such as chromatic and clique number 
were fairly well understood throughout the evolution. The argument for local 
coloring required a “new idea” but the basic framework was already in place. 
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6. 1963/4: Coloring Hypergraphs 


Let Aj,...,Am be n-sets in an arbitrary universe 2. The family A = 
{A,,...,Am} is 2-colorable (Erdés used the term “Property B”) if there is 
a 2-coloring of the underlying points Q so that no set A; is monochromatic. 
In 1963 Erdos gave perhaps the quickest demonstration of the Probabilistic 
Method. 


Theorem 4 ([5]). [fm <2"~! then A is 2-colorable. 


Proof. Color Q randomly. Each A; has probability 2'~” of being monochro- 
matic, the probability some A; is monochromatic is then at most m2!~” < 1 
so with positive probability no A; is monochromatic. Take that coloring. 


In 1964 Erdés [6] showed this result was close to best possible. 


Theorem 5. There exists a family A with m = cn?2” which is not 
2-colorable. 


Here Erdos turns the original probability argument inside out. Before the 
sets were fixed and the coloring was random, now, essentially, the coloring is 
fixed and the sets are random. He sets 2 = {1,...,u} with u a parameter 
to be optimized later. Let A,,...,Am be random n-sets of Q. Fix a coloring 
x with a red points and b = u —a blue points. As A; is random 


()+Q) . 2002) 
i 
The second inequality, which follows from the convexity of ea indicates 


that it is the equicolorings that are the most troublesome. As the A; are 
independent 
| m 


Pr[x(A;) constant] = 


2(/2) 


n 


(n) 


Now suppose 


Pr{no A; monochromatic] < c _ 
9 u/2 


m 
<1 
(7) | 
The expected number of x with no A; monochromatic is less than one. 


Therefore there is a choice of A,,...,Am for which no such y exists, i.e., 
A is not 2-colorable. Solving, one may take 


— E in [1 “AIG | 


Estimating —In(1 —¢) ~ ¢ this is roughly cu(“)/(“/?). This leads to 
an interesting calculation problem (as do many problems involving the 
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Probabilistic Method!)—find u so as to maximize m. The answer turns out 
to be u~ n?/2 at which value m ~ (eln2)n?2"~?, 

Erdés has defined m(n) as the least m for which there is a family of n-sets 
which cannot be 2-colored. His results give 0(2”) = m(n) = O(n?2"). Beck 
has improved the lower bound to Q(n!/32") but the actual asymptotics of 
m(n) remain elusive. 


7. 1965: Unrankable Tournaments 


Let T be a tournament with players 1,...,n (each pair play one game and 
there are no ties) and o a ranking of the players, technically a permutation 
on {1,...,n}. Call game {7,7} a nonupset if 7 beats 7 and o(i) < o(j); an 
upset if i beats 7 but o(j) < a(t). The fit f(T, o) is the number of nonupsets 
minus the number of upsets. One might have thought—in preprobabilistic 
days!—that every tournament T had a ranking o with a reasonably good fit. 
With J.W. Moon, Erdés [7] easily destroyed that conjecture. 


Theorem 6. There is aT so that for allo 
f(T, 0) < 3/7 (Inn)/?. 


Thus, for example, there are tournaments so that under any ranking 
at least 49% of the games are upsets. Erdés and Moon take the random 
tournament, for each pair {i,j} one “flips a fair coin” to see who wins the 
game. For any fixed o each game is equally likely to be upset or nonupset 
and the different games are independent. Thus f(T',o) ~ Sm, where m = (”) 
and S,,, is the number of heads minus the number of tails in m flips of a fair 
coin. Large deviation theory gives 


a2 
Pr[Si, > al <e 2. 


One now uses very large deviations. Set a = n3/?(Inn)!/? so that the above 
probability is less than n~" < 1/n!. This super small probability is used 
because there are n! possible a. Now with positive probability no o has 
f(f,c) > a. Thus there is a T with no o having f(T,c) >a. 

The use of extreme large deviations has become a mainstay of the 
Probabilistic Method. But I have a more personal reason for concluding with 
this example. Let g(n) be the least integer so that every tournament T on 
n players has a ranking o with f(T,o) > g(n). Then g(n) < n3/?2(Inn)!/?. 
Erdés and Moon showed g(n) > cn, leaving open the asymptotics of g(n). 
In my doctoral dissertation I showed g(n) > cn°/? and later (but see de la 
Vega [11] for the “book proof”) that g(n) < cn°/?. Though at the time I 
was but an ¢ Paul responded with his characteristic openness and soon [9] I 
had an Erdés number of one. Things haven’t been the same since. 
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IV. Geometry 


Introduction 


Erdos’ love for geometry, and elementary or discrete geometry in particular, 
dates back to his beginnings. The Erddés—Szekeres paper has been influential 
and certainly helped to create discrete geometry as we know it today. But 
Erdés also put geometry to the service of other branches, giving definition 
to various geometrical graphs and proving bounds on their chromatic and 
independence numbers. We are happy to include papers by Moshe Rosenfeld, 
Pavel Valtr, Janos Pach, Jiti Matousek and, in particular, a paper by Miklés 
Laczkovich and Imre Ruzsa on the number of homothetic sets. While the 
paper of Peter Fishburn is closely related to Erdés’ favorite theme, the 
papers of N. G. de Bruijn (on Penrose tiling) and J. Aczél and L. Losonczi 
(on functional equations) cover broader related aspects. 

In 1995/6, when the content of these volumes was already crystallizing, 
we asked Paul Erddés to isolate a few problems, both recent and old, for each 
of the eight main parts of this book. To this part on Geometry Theory he 
contributed the following collection of problems and comments. 
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Erdos in his own words 

Let x1,...,2%, be n points in the plane, not all on a line, and join every 
two of them. Thus we get at least n distinct lines. This follows from Gallai— 
Sylvester but also from a theorem of de Bruijn and myself. 

My most striking contribution to geometry is no doubt my problem on 
the number of distinct distances. This can be found in many of my papers 
on combinatorial and geometric problems. 

Hickerson, Pach and I proved that on the unit sphere one can find n 
points for which the distance 2 can occur among n!/° pairs. Perhaps this 
is best possible. In fact, there are n!/3 points at distance 1 from every other 
point. For every 0 < a < 2 there are n points so that for every point there 
are log* n other points at distance a—again we do not know if this is best 
possible. 

Purdy and I proved (using an idea of Karteszi) that there are n points 
in the plane with no three on a line for which the unit distance occurs at 
least cn logn times. We have no nontrivial upper bound. If the points are in 
3-space the unit distance can occur n4/? times (Hickerson, Pach and myself) 
but if we also assume that no four are on a plane, we can do no better than 
cnlogn. 

Szekeres and I proved, that if ee) + 1 points are given in a plane no 
three on a line, then we can always select among them n points which are 
the vertices of a convex n-gon. Probably 2”~? + 1 is the correct value—we 
proved that 2”~? is not enough. This problem (which was due to E. Klein, 
i.e., Mrs. Szekeres) had a great influence. 
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So much for Paul Erdos in 1996. He stated it here explicitly: the distance 
problem is highlighted as the most important of his problems in geometry. 
We are very fortunate that we can include in this part the paper by Larry 
Guth which surveys the solution of this problem. His article also describes 
the rich and fertile ground of Erdés problems in geometry. 

Let us remark that some of the other advances are described in the 
other chapters, for example the development related to the Erdés—Szekeres 
Theorem is part of the chapter by Graham and NeSetril in the Ramsey theory 
chapter in the second volume. We cannot be exhaustive as the number of 
geometrical problems and results discussed by Erdés was large. One topic 
which is missing in this part is clearly incidence problems. However they 
are partially covered in the number theory part and are also part of Guth’s 
article. 

The progress in combinatorial and discrete geometry has been spectacu- 
lar. So much so that Matousek speaks about 2010 as annus mirabilis: 

J. Matousek, The Dawn of an Algebraic Era in Discrete Geometry, Euro 
CG 2011, Morschach. 
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1. Introduction 


Extension theorems are common in various areas of mathematics. In topology 
continuous extensions of continuous functions are studied. In functional 
analysis one is interested mainly in linear extensions of linear operators 
preserving continuity or some other properties like bounds or norm. In algebra 
extensions of homomorphisms and isomorphisms are investigated. The latter 
can be considered as extensions of functional equations. 

In the area of functional equations one is interested in extending 
functional equations, i.e., either extending functions satisfying a functional 
equation (or a system of equations) on a “restricted” set to functions which 
satisfy the same equation (or system of equations) on the “maximal” set or 
showing that given functions satisfying a functional equation (or a system 
of equations) on a restricted set satisfy the same equation (or system) on a 
“larger” set. 

The first extension theorem for the Cauchy functional equation is in the 
1965 joint paper [AE 65] of Erdés with Aczél who proved that, if a function 
f :]0,c0o[- R satisfies the Cauchy functional equation for all positive values 
of x and y, that is 


faty)=fa)+fy) (2, €]0, cof), 


then there exists a unique function F : R — R such that F(x) = f(x) if 
x €j0, co[ (ie. F is an extension of f) and F satisfies the Cauchy equation 
for all values of x, y: 


F(a +y)=F(a)+ Fy) (@,y eR). (1) 
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In the same paper they showed that there exist no Hamel bases of the set 
of nonnegative numbers, i.e., there does not exist any set B’ whose elements 
are nonnegative and any nonnegative number is representable as a linear 
combination of a finite number of elements of B’ with nonnegative rational 
coefficients in a unique way. 

In 1960 in the problems section of Colloquium Mathematicum Erddés 
[E 60] raised the following problem. If f(x + y) = f(x) + f(y) is satisfied 
for almost all (x,y) in the plane (in the sense of plane Lebesgue measure) 
does there exist a function F' satisfying the Cauchy equation everywhere (i.e. 
satisfying (1)) such that f(a) = F(a) almost everywhere (in the sense of the 
linear Lebesgue measure)? 

The Aczél-Erdés theorem and the problem of Erdés (solved independently 
by Jurkat [J 65] and de Bruijn [B 66]) became the starting points of two 
different directions of the extension theory of functional equations, which by 
now is quite extensive (Kuczma [K 78] quoted already in 1978 more than 100 
related papers; see also [PR 95]). 

The aim of this paper is to show how the above result and problem of 
Paul Erdés influenced the development of the extension theory of functional 
equations. 


2. Extensions of Homomorphisms 


Let G, H be (multiplicatively written) groups and S be a subsemigroup of 
S. Let further f : S — H be a homomorphism of S into H. Under what 
conditions can f be extended in a unique way to a homomorphism of G? 
With G = H = the additive group of R and S = the additive group of 
positive real numbers, we get back to the Aczél- Erdés extension problem. 
The general problem was treated by Aczél et al. [ABDKR 71] by proving 
several theorems which under different assumptions on S' gave a solution. 
The first one is closely related to the extension theorem in [AE 65]. 

Suppose that for every element x € G, different from the unit element, 
either x € S or x € S (or both). Then every homomorphism f :S + H 
can be extended uniquely to a homomorphism F': G— H of G into H. 

This can be proved by defining F’ by 


E ifzv=e 
F(z) := ¢ f(z) ifee S 
f(a) ifetes 
where e, F are the unit elements of G, H, respectively. 


A unique extension also exists when S generates the Abelian group G. 
Indeed, in this case G= S$. S~! = {xy '| a € S,y € S} and by 


Fay) = f(x)fy)" @ye 8) 


Extension of Functional Equations 449 


F is well defined on G (because zy~! = uv! (x,y, u,v € S) implies rv = uy) 


and supplies the unique extension of f. 
A more elaborate result is the following. 
Suppose that S is a subsemigroup of the group G such that 


G=S-S-1.§.S-1 ={ary"uv | 2, y,u,0 € S} (2) 


and f is a homomorphism of S into H. The map f can be extended to a 
homomorphism of G into H if and only if 


L,Y, Z,U,v,w € S, cy 'z=uv!w implies f(x) f(y) f(z)=f (uf (vy f(w). 


When the extension exists, then it is unique. 

There are further results in this vein in [ABDKR 71]. Martin [M 77] 
generalized these results to the case when G can be represented in a product 
form similar to (2) but with an arbitrary number of factors. 

Necessary and sufficient conditions for the extension were found by 
Osondu [O 78] in the case when S generates G: 

Let S be a subsemigroup of a group G which is generated by S and f 
a homomorphism of S into some group H. Then f can be extended to a 
homomorphism of G into H if and only if f satisfies the following condition. 

For every positive integer n and for every 5; € S, e¢, € {-1,1} @ =1,2, 
12.5 n) 

n n 
Il =e=> [][ fa" =E 
i=1 i=1 
holds, where e, E are the unit elements of G, H respectively. 


In this result the necessary and sufficient condition seems to be too “close” 
to the statement itself. A more detached condition may be desirable. 


3. Extensions of the Cauchy and Pexider Equation 
and Their Applications 


We introduce some terminology (see Dardéczy-Losonczi [DaL 67]). Let D be 
a nonempty subset of R?. We write 
D, := {x | dy : (z,y) € D}, 
Dy = {y | de: (x,y) € D}, 
Daty = {u+y | (ey) € D}. 


Then D,, Dy are the projections of D onto the x-axis, y-axis, respectively, 
and D,z4+, is the projection of D, parallel to the line x+y = 0, onto the z-axis. 
Let further D’ := D, U Dy U Dz+y. We say that a function f is additive on 
the set D (or satisfies the Cauchy equation on D) if f : D’ > R and 
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fle+y)=f(a)+fy) ((@,y) € D) (3) 


holds. A function F : R > R is called an (additive) extension of a function f 
additive on D (we say also that F extends the Cauchy equation from D to 
R?) if F satisfies the Cauchy equation on R? and F(x) = f(x) for « € D’. 

We saw above that there exists a unique (additive) extension from R7_ (Ry 
is the set of positive reals) to R? (take S$ = R,). As Daréczy and Losonczi 
[DaL 67] showed, there always exists a unique (additive) extension from a 
neighborhood of (0,0)toR?. They took the neighborhood to be circular but it 
turned out [A 83] that “hexagonal neighborhoods” of (0,0) 


A, = {(z,y) | 2,y,x+y €] —1r,r[} 
are more convenient. Indeed, if 
f(e+y) = f(a) + fly) for (x,y) € Hy (4) 
then define for any t€ R 


Fens (2) (u N, = ina 


This definition is unambiguous since, by (4), 


(8) =) =m 5) =mr (6) en 


Clearly F(x) = f(x) for « €] — r,r[ (choose n = 1). The function F’, so 
defined, is additive on R? since, for arbitrary u,v € R, there exists an n € N 
such that 4,2, “** €] —r,r[, thus, again by (4), 


nn? 


F(u+v) =nf (= +=) =nf ( ) tng ( ) = Flu) + F(). 


U U 
nm mr 


This F : R > R, extending (4) from H, to R*, is unique because, if also 
F':R-R were additive on R? and would satisfy F(x) = f(x) for x €]—r,r[ 
then, for arbitrary t € R choosing n € N again so that + €]—r,r[, we get 


FQ) =F (n*) anf (<) apg (=) — F(t) for allt eR 


as asserted. 
However there may not exist an additive extension even in some very 
simple cases. Let, for instance f be defined by 
2e+1 for x €}2,3] 
f(x) = 4224+3 for x €]4,5] . (5) 
2x+4 for x €]6,8[ 


This function is additive on D =]2,3[x]4,5[ but no additive extension to 
R? exists. Indeed, such an extension F would be continuous on ]2,3[, thus 
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(see e.g. [A 66]) there would exist a constant c such that F(a) = cx for 
all « € R which contradicts F(#) = f(x) for x € D’ =]2,3[U]4, 5[U6, 8]. 
Nevertheless, we see from (5), that F(x) = 2x is “nearly” an extension of 
this f: it differs from f on Dz, Dy and Dz+, just by a constant each and is 
additive everywhere. Such functions are called quasi-extensions. 

To be exact, if 


fle+y)=fa)+fy) (ay) €D) (3) 


holds and there exist constants a, 6 and a function F,, additive on R?, such 
that 


f(x) = F(a) +a for all x € Dz, 

f(y) =F(y)+86 for all y € Dy, (6) 
) 
) 


f(z)=F(z)+a+8 forall z€ Dry, 


then F is an (additive) quasi-extension of f (from D to R?). These formulas 
are reminiscent of the general solution 


f(x) = F(x) +a 
gy) = F(y) + 8 (7) 
A(z) = F(z) +a+8 


(F is an arbitrary additive function on R?, a and @ are arbitrary constants) 
of the Pexider equation 


A(x +y) = f(x)+9(y) ((x,y) € R?). 


Since in (5) or (6) (and also in (3)) the three occurences of f are anyway 
defined on possibly different intervals, this is conducive to consider (3) as a 
Pexider equation 


h(@+y)=f(x)+9y) ((a,y) € D) (8) 


with f,g,h defined on Dz, Dy, Dr+y, respectively. As it turns out, these 
have an extension (not only quasi-extension) to R? from the above set 

=]2,3[x]4,5[ and from every open connected set (region) D. 

In fact, the basic result concerning quasi-extensions of the Cauchy 
equation and extensions of the Pexider equation is that from any open 
connected set (region) D in R? there exists a unique quasi-extension of 
the Cauchy equation (3) to R? (Daréczy-Losonczi [DaL 67]) and a unique 
extension of the Pexider equation (8) to R? (Riman [R 76], Aczél [A 85], and 
Rad6-Baker [RB 87]). The former follows from the latter. Both can be proved 
by taking for D first a hexagon, like H; above, but its centre shifted from the 
origin to (a,b), showing, for Eq. (8), by substitutions that 


h(u+tvu+a+tb)—h(a+b) = f(u+a)— f(a)+g(vt+b)—g(b), (9) 
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where (u,v) is in the hexagon H, around the origin. This implies (putting 
v = 0 or u=0, respectively) that 


h(t+a+b)—h(a+b) = f(t+a) — f(a) = g(t +b) —g(b)  (t€]—7,r)). 


Defining the function ® by equating ®(t) to this common value on | — r,r|, 
(9) shows that ® is additive on H,. so, by what we proved above, has a unique 
(additive) extension F to R?. The functions given by 


F(a) = F(z) — F(a) + f(a) 
gy) = Fy) — F(®) + g(®) (10) 
h(z) = F(z) — F(a+b) +h(a+b) 


(a,y,z € IR), and only these, extend the Pexider equation from the (a, b)- 
centred hexagon to R?. The proof that the Perider equation has a unique 
extension from any open connected region D to R? can be completed by 
covering D by a sequence of hexagons such that each hexagon is contained 
in D and any two consecutive hexagons have nonempty intersection and by 
applying the above extension process to each hexagon. Of course, (10) has 
the form (7) of the general solution of Pexider’s equation on R?. 

The general solution of the Pexider equation (8) on D is obtained by 
replacing, in (10), —F(a)+ f(a) and —F'(b) + g(b) by two arbitrary constants 
and —F(a+b)+h(a+b) by their sum, and restricting f,g,h to Dz, Dy, Dr+y 
respectively. The function F’, additive on R?, which we have just determined, 
is also the unique quasi-extension of f in (3) from D to R?, in the sense (6). 
If two of the sets D, (| Dy, Dz(\Dea+y, Dy(\Diu+y are nonempty, then 
the quasi-extension is an extension of the Cauchy equation. 

We note that the concept of extension and quasi-extension can be 
defined similarly for more general (in particular some closed) domains, ranges 
(groups, vector spaces) and equations. In their paper [RB 87], quoted above, 
Rad6 and Baker had a real or complex topological vector space as domain of 
f, g, h and an Abelian group as range. We mention a few further results. 

Let G and H be Abelian divisible groups and let X be a subset of G having 
the properties 


(i) For any x € X and for all rational numbers X €]0,1[ we have Az € X; 
(ii) For any pair (x,y) € X? there exists an integer n, which may depend 
upon x and y, such that (1+ y)/nE xX. 


Suppose that f : X > H is additive on D:= {(x,y)| ve X,yEeX,x+ye 
X}. Then there exists an additive extension F : G — H of f. Moreover F 
is unique if, and only if, the subgroup generated by X is G. (Ng [N 74] and 
Dhombres-Ger [DG 78].) 

Székelyhidi [S 72] found the general representation of functions which are 
additive on an open set D C R?. He also proved [S 81] an extension theorem 
for the equation A?*' f(x) = 0 where A, is the usual difference operator 
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defined by A, f(z) := f(a+y) — f(x). Let again D C R?, na positive integer 
and for k = 0,1,...,2+1 let 
Dg = {et (n +1 —b)y | (2,9) € Dh. 


The result by Székelyhidi is the following. Let D C R? be an ae and 
connected set with (0,0) € D. Let n be a positive integer and f : Maes ok 
R be a function such that 


n+1 
7 n+1 ' 
ays) => ("FP ') CMe + (n+ 1 Ku] =0. (ey) € D), 
k=0 
Then there exists an extension F: R—R of f such that gas aed =0. 
The Levi-Civita equation 


f(a+y) = a) (11) 


is a common generalization of the Cauchy and Pexider equation and the sine 
and cosine equations. 

Recently the second author [L 85a, L 90] proved an extension theorem 
which corresponds to the Aczél-Erdés result for Eq. (11). 

Let the functions f, 9%, hx :]0,oo[> C (kK =1,...,n) satisfy the functional 
equation (11) for all positive x, y and assume that the functions 91,...,9n 
and hy,...,hn are linearly independent on |0,00[. Then there exists a unique 
set of functions F,G,, H, : R- C(k =1,...,n) such that 


F(x+y) => Ger (x,y € R) 
k=1 
and 
F(x) = f(x) 
Gi(x) = gr(x) (a €]0, cof; K=1,...,n) 
Ay (x) = hg (a) 
hold. If f, gx, he(k = 1,...,n) are continuous or measurable on ]0, oo[ then 


so are F', Gy, Hy (k =1,...,n) on R. The proof is based on the following 
observation. It is known ([A 66] pp. 201-203) that the functions F),..., Fn, 
defined on a set A with values in an arbitrary field, are linearly independent on 
Aif, and only if, there exist elements a1,...,@n € A (necessarily distinct) such 
that det(Fi(a;))?j;-, # 0. Thus by the linea independence of the functions 
Jiy+++>9n there exist z1,...,2;, €]0,co[ such that det(g:(x}))? ;— 1 # 0. 
Without loss of generality we may suppose that x = min{a’, ..., 2) }. Let 
Ly = v}y,— 2 (k =1,...,n) and substitute «+2, for c(k =1,.. ee in (11). 
We obtain a system of equations which can be written as 


Fla+y)=G(@)H(y) (x,y €]0, col) (12) 
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where, for x €]0, oo[ (x1 = 0), 
F(a) = (feta)... f(@+an))" 


gi(a@ +21)... Gn(w@ +21) 


gila + Bice Qn (h 4 Dy) 
H(x) = (hi(x),...,hn(x))* 


and T denotes transposition. Extending the Pexider equation (12) we obtain 
an extension of (11). 


4. Almost Everywhere Additive Functions 


The problem of Erd6és, mentioned in the introduction, was inspired by 
Hartman [H 61] who proved that, if f(~+y) = f(x) + f(y) is satisfied for 
(x,y) € A x A, where the complement of A has (linear) measure zero, then 
there exists a function F': R > R satisfying the Cauchy equation everywhere 
and F(a) = f(x) almost everywhere on. R Actually Hartman’s paper [H 61] 
appeared later than [E 60] as a partial solution to the Erdés problem [E 60]. 
Jurkat [J 65] and de Bruijn [B 66] (independently) solved the problem in the 
affirmative, showing that, if f(a +y) = f(x) + f(y) holds for all (x,y) € D 
where the Lebesgue measure of R? \ D is zero, then there exists a function 
F:R-R satisfying the Cauchy equation everywhere and F(x) = f(x) ae., 
that is, for all x € Dy where R\Dy, is of Lebesgue measure zero. 

Is this F' an (additive) extension of f (in the sense of Darédczy-Losonczi)? 

The answer is no. Counter example (by V. Zinde-Walsh, simplified by 
C.T. Ng and J. Aczél, see [A 80, AD 89]): 


D:={(z,y)|a,y,e+y #4 1,2}U {(1,1)}, 
i.e., D is obtained from R? by removing from it the lines = 1, x = 2, y= 1, 
y =2,a+y =1,2+y = 2 except the point (1,1). Clearly R?\D is of Lebesgue 
measure zero and D’! = R. Let 


0 if €R\{1,2}, 
f(@)=41 ife=1, 
2 ifa=2. 


This function is additive on D (we have f(1+1) = f(2) = 2 
f(1)+f(1) and elsewhere, for (x, y) € D, we get f(x) = f(y) = f(aty) 
but f is obviously not additive on D’ = R. 

Questions similar to Erdés’s problem can be asked for other functional 
equations and inequalities. 
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Let J C R be an open interval. A function f : I — R is called almost 


convex if 
*(H4) <2 ((2,y) € Ix IM), 


where M C I x I is of planar Lebesgue measure zero. Kuczma |K 70] proved 
that, if f : I > R is almost convex, then there exists a unique convex function 
F:I-R such that F(x) = f(#) ae. in I. 

Ger [G 71, G 75] noticed that “equality almost everywhere” can also be 
introduced in an axiomatic way. 

A nonempty family Z* of subsets of the k-dimensional euclidean space 
R* is called a Linearly Independent Proper Ideal (abbreviated as LIPI) if 


) A, BE T* implies AUB €Z*, 
) AETI*,BCA implies B € T*, 
) For every real number a # 0, 8 € R*, and A € Z* we have aA+6 €T*. 
It can be easily seen that the family ye of all subsets of R® with Lebesgue 
measure zero is a LIPI. Similarly, the families F* and Lt of all sets of the 
first category in R* and of all sets having finite Lebesgue measure in R*, are 
also LIPIs. 

We say that two LIPIs Z? and TZ! are conjugate if for every M € T? there 
exists a set U € Z! such that, for every x € U, the set 


Vo = {y| (ey) © M} 
belongs to Z*. In view of Fubini’s theorem, the LIPIs £5, F?, £7 and Lo, F*, 
L} are, respectively, conjugate. 

Ger [G 71] proved the following result. Let Z? and Z' be conjugate linearly 
invariant proper ideals. If f : R > R satisfies the equation Names (0) = 0 
for all (x,y) € R*M where M € TZ? then there exists exactly one function 
F:R- Randa setU €T' such that Att! F(a) =0 holds for all (x,y) € R? 
(i.e., F' is a polynomial function) and F(x) = f(a) for alla ¢ U. 

Ger [G 75] found similar results for the Mikusitiski equation 


fle+ylf@+y) — f(a) — fy] =0 


and for the Pexider equation 


f(x+y) = g(x) + h(y) 


5. Some Applications 


The above extension theorems have important and amusing applications. 
An example is that of “aggregated allocations” see [AKNW 83, ANW 84, 
AW 81). 
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Suppose that there is a certain amount s of quantifiable goods (raw 
materials, energy, money for scientific projects, grants etc.) is to be allocated 
(completely) to m(> 3) projects. For this purpose a committee of n assessors 
is formed. The j-th assessor recommends that the amount 2z;; should be 
allocated to the j-th project. If he (she) can count, then en xij = s. Now 
the recommendations should be aggregated into a consensus allocation by 
a chairman or external authority. Again the allocated consensus amounts 
should add up to s. It is supposed (this is a restriction) that the aggregated 
allocation for the 7-th project depends only on the recommended allocations 
to that project, that both the individual and the aggregated allocations 
be non-negative (!) and, if all assessors recommend rejection, then the 
consensus allocation to that project should be 0 (‘consensus on rejection’). 
If we write the individual allocations for the j-th project as a vector x; = 
(1;,€2;,---,2njz) and denote the aggregated allocation by f;(x;) then these 
conditions mean the following: 


f; + [0,s]" > [0,5], f;,(0)=0 ( =1,2,...,m) 


and 

= =sl= > S° fi(xj) =s (13) 

j=l j=l 
where 1 = (1,1,...,1), sl = (s,8,...,8), and 0 = 01 = (0,0,...,0). With 
X1 = 81, xX. =... = Xm = O we get fi(s1) = s and, since the subscript 
1 has no privileged role, f;(s1) = s for all 7 = 1,2,...,m (‘consensus on 
overwhelming merit’). Now we substitute into (13) x1 = z = (21, 22,.--,2n), 
x3 = sl —Z, Xo =X4=...=Xm = 0: 

fi(z) =s— fs(sl—z) (2 € [0,s]”). 
The substitution of x} = x = (#1, Y2,...,%n), X2 = Y = (Yi, Y2,---, Yn); 
x3 = sl—x-—y, X4=...=Xm = 0 gives 
fix) + foly) =s— fa(sl-x-—y)=filx+y), (14) 


whenever x, y, xX + y are all in [0,s]". Putting here x = 0 we get fo(y) = 
f1(0) + fo(y) = fily) for all y € [0,s]". Again the subscripts 1,2 have no 
privileged role, so 


fi fo faa tin f on (0, s|” 
and (14) reduces to the Cauchy equation 
fx+y)=fx)+fly) (&y) €D), (15) 


where D := {(x,y) | x,y,x +y € [0,s]"}. All solutions of (15) can be 
extended from D (even though it is closed) to R” x R”. Applying the known 
result that every n-place additive function, nonnegative on an n-dimensional 
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interval, is of the form of an inner product (a,x) (a is a constant vector with 
nonnegative components a;), we have 


n n 
fj (21, 02,-.-, Ln) =o aim; (« > 0, Soa; = 1 = 12, : 
i=1 i=1 
We have found that the weighted arithmetic mean is the general solution of 
our allocation problem. 
Many applications concern local solutions of functional equations. The 
functional equation 


f(x +y— ry) + f(xy) _ f(x) ae f(y) (x,y €]0, 1{, [0, 1], or R) 


was introduced by Hosszi. The most complicated case is the one where f : 
]0, 1/4 R and the equation holds for x,y €]0,1[. The general solution is 
(Lajk6 [La 74]) 


f(x) = A(x) +b (a €J0,1[) 


where A : R > R is an arbitrary additive function (on R?) and 6 is an 
arbitrary constant. The proof is based on the fact that the function 7 —> 
f(a@+1/2) — f(1/2) is additive on the “triangle-like” set 


1 1 
1 <¥< 5 + LI. 
A is obtained as the quasi-extension of this function. 

The extension theorem for the Levi-Civité equation (together with the 
extension theorem of Dardéczy-Losoncai) can be applied to the solution of 
functional equations of sum form (see [L 85a, L 91)). 

Kiesewetter [Ki 65] proved that the “arctan equation” 


foziwer ( 


1 1 
D={(a,y)|—-5<2<O5+ 


a+y 
1—sxy 


) (z,y €R, 1- ay £0) (16) 


has no continuous solutions except f(x) = 0 (a € R). Crstici, Muntean and 
Vornicescu [CMV 83] found the local solutions of the arctan equation valid on 
the set Dy := {(x,y) € R? | zy < 1} satisfying various regularity conditions 
(continuity at one point or boundedness or measurability on an interval etc.). 
The general local solution on D, was found by Muntean and Vornicescu 
[MV 83]. 

The second author proved [L 85b, L 90] that the general solution of (16) 
is the function 


f(x) = A(arctanz) (# € R) 


where A: R > R is an arbitrary additive function (on R?) periodic with 
period 7. 
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This clearly implies f(tanra) = 0 for all rational numbers r, explaining 
Kiesewetter’s result. In [L 85b, L 90, MV 83] again the extension theorem of 
Daréczy and Losonczi was the main tool of the proof. 

Finally we mention that the result of de Bruijn and Jurkat can be applied 
to find all bounded multiplicative linear functionals on the complex Banach 
algebra of Lebesgue integrable functions (under convolution) [A 80]. 
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1. Introduction 


1.1 


This paper will cover some details on Penrose tilings presented in lectures over 
the years but never published in print before. The main topics are: (i) the 
characterizability of Penrose tilings by means of a local rule that does not 
refer to arrows on the edges of the tiles, and (ii) the fact that the Ammann 
quasigrid of the inflation of a Penrose tiling is topologically equivalent to the 
pentagrid that generates the original tiling. 

The fact that any Penrose tiling is the topological dual of the Ammann 
quasigrid of the inflation was first noticed by Socolar and Steinhardt [9]. They 
presented it in the equivalent form that the topological dual of the Ammann 
quasigrid of a Penrose tiling is the deflation of that tiling. 

The Ammann quasigrid of the inflation of a Penrose tiling can also be 
defined as the union of the central lines of the stacks of that tiling. Therefore 
I refer to that union as the central grid of the tiling. It can be defined 
independently of the original notion of Ammann grid. Actually the definition 
of the central grid can also be given for other kinds of tilings where there is no 
obvious definition of an Ammann grid and no obvious definition of deflation. 

The paper is intended to be readable more or less independently of 
previous ones, at least in the sense that all relevant notions will be explained 
in the paper itself. 


1.2 


My geometric terminology will use the notion of shapes and 1-shapes 
in the plane. A shape is an equivalence class of figures under similarity 
transform. Similarity includes multiplications (with respect to a point), 
shifts and rotations, but no reflections with respect to a line. In terms of 
complex numbers, this similarity transform just means linear transformation. 
A 1-shape is defined similarly, but without multiplications. That means that 
similarity is replaced by congruence. In other words, figures with the same 
1-shape have the same shape and the same size. 
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Fig. 1 The 1-shapes V (thin rhomb) and W (thick rhomb) 
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Fig. 2 Penrose’s arrowed rhombs V, (thin) and Wz (thick) 


Throughout this paper I shall use the two 1-shapes V and W, pictured in 
Fig. 1. They are rhombs, with all edges having unit length. The acute angles 
of V are 36°, and those of W are 72°. V will be called the thin rhomb and 
W the thick rhomb. The word “rhomb” will always refer to a V or a W. 

The arrowed rhombs V, and W, (the subscript a stands for “arrowed” ) 
are obtained from V and W by putting single and double arrows on the edges 
in the way depicted in Fig. 2. They will be called the Penrose rhombs. 


1.3 


A Penrose tiling is a tiling of the plane by V,’s and W,’s with the property 
that two tiles always have either nothing, or a vertex, or a full edge in 
common; in the latter case they are called direct neighbors and it is required 
that along the common edge they have the same kind of arrow in the same 
direction. Figure 3 shows a fragment of such a tiling. 

Various procedures for obtaining all Penrose tilings are known, like 


(i) Penrose’s use of deflation, in combination with a (non-constructive) 
selection argument (see [4] for an exposition). 
(ii) The use of deflation, in combination with “updown-generation” (see [5]). 
iii) Forming the dual of a “pentagrid” and providing the arrows afterwards 
g g g 
(see [1]). 


In [5] the relation between (ii) and (iii) was studied in detail. 

Somewhat more complicated, but nevertheless interesting and promising, 
are ways to take the Ammann bars (see [7, 9]) as the basis of the Penrose 
tilings. 
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Fig. 3 A piece of a Penrose tiling 


1.4 


One of the topics to be treated in the present paper is the question how one 
can determine whether a tiling of the plane by V’s and W’s can be provided 
with arrows so as to form a Penrose tiling with V,’s and W,’s. It will be 
shown (Sect. 2) that this can be settled by inspecting, for every rhomb in the 
unarrowed tiling, the figure formed by its four direct neighbors. 

Section 2 will pay attention to the question of a tiling give by the vertices 
only. 


2. Arrow-Free Characterization of Penrose Tilings 


2.1 


The term unarrowed rhomb tiling will mean a tiling of the plane with V’s and 
W’s, with the condition that neighboring tiles have a full edge in common. 

Any Penrose tiling turns into an unarrowed rhomb tiling by omitting the 
arrows on the edges. 

For any unarrowed rhomb tiling, any way to attach an arrow to every 
edge such that it becomes a Penrose tiling, will be called an arrowing of the 
tiling, and whenever such an arrowing exists the tiling is called arrowable. 
Not every unarrowed rhomb tiling is arrowable. A simple counterexample is 
the doubly periodic tiling by thick rhombs only. 

It will be shown that the condition for an unarrowed rhomb tiling to be 
arrowable can be put into a form that does not refer to arrows any more. 
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2.2 


Consider a figure formed by an unarrowed rhomb ¢ and four neighboring 
rhombs fj, tz, t3, t4, each one of these having one edge in common with t. It 
is required that there is no overlap between any two of these rhombs (apart 
from possible common edges). Let me call such a figure a cross. 

If to a cross formed by t, t1, te, ts, t4 one adds a number of further 
rhombs ts,...,t, which all have a vertex in common with t, such that there 
is no overlap between any two of t,t ;,...,t,%, and such that the areas around 
the vertices of t are entirely covered, then that figure will be called an extended 
Cross. 


2.3 


An arrowing of a cross or an extended cross is a way to attach an arrow 
(either single or double) to every edge in the figure such that each one of its 
rhombs becomes a Penrose rhomb. 

Not every cross can be arrowed. And a cross that can be arrowed is not 
necessarily extendable to an extended cross that can be arrowed. 

A cross will be called perfect if it is extendable to an extended cross that 
permits an arrowing. 


24 


It is not hard to make a list of all possibilities for perfect crosses. Starting 
with a thin rhomb ft, trying all possible sets of direct neighbors t1, ta, ts, ta, 
and investigating whether they have at least one arrowable extension, one 
finds 6 possibilities, pairwise related by 180° rotation, leading to 3 different 
1-shapes P,, P2, P3. Similarly, if starting from a thick rhomb, one gets to 
8 possibilities, which are pairwise related by 180° rotation, so there are 4 
different 1-shapes Py, Ps, Ps, Pr. 

For the notion “l-shape of a perfect cross” I shall also use the term 
neighborhood pattern. Figure 4 gives them all. 

P, and P, form a pair, in the sense that the mirror image of P, has the 
same 1-shape as P,. In the same sense Ps; and Ps form a pair. The others, 
P;, Py and P7, are symmetric: the axis of symmetry is the short diagonal of 
t in P3 and the long diagonal of ¢ in Py and P7. 


2.5 


Each one of the seven neighborhood patterns can be arrowed in exactly one 
way. These arrowed patterns are called P,,,..., P7q, and are shown in Fig. 5. 


Remarks on Penrose Tilings 465 


P 
va Nae 
f 


te 


Fig. 4 The neighborhood patterns P,,..., P7 


2.6 


At this stage it becomes possible to answer the question which unarrowed 
rhomb tilings can be arrowed. 

I shall say that an unarrowed rhomb tiling satisfies the cross condition if 
for each one of its rhombs the cross formed by that rhomb and its four direct 
neighbors is one of the seven 1-shapes P;,..., Py. 


Theorem 1. (i) If an unarrowed rhomb tiling of the plane is arrowable 
then it satisfies the cross condition. 
(it) If an unarrowed rhomb tiling of the plane satisfies the cross condition 
then it can be arrowed in exactly one way. 


Proof. (i) This is little more than the fact that the list of neighborhood 
patterns provided in Sect.2 is exhaustive. If a cross is a part of an 
arrowable rhomb tiling, then it also has an extension to an extended 
cross that lies in that tiling, and any arrowing of the whole rhomb tiling 
implies an arrowing of that extended cross. 

(ii) Start from an unarrowed rhomb tiling that satisfies the cross condition. 
So if t is one of the rhombs in the tiling then it is the central rhomb 
of a cross of one of the types P,,...,P;, and that uniquely defines an 
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arrowing of t according to Fig. 5. So the arrows on t are imposed by the 
cross of t. They will be called the imposed arrows of t. 

In this way an arrowing is prescribed for every rhomb in the tiling, but 
it is the question whether neighbors get the same imposed arrow on their 
common edge. To that end it has to be shown that if in the tiling two rhombs 
s, t have an edge in common, then along that edge the arrow of s (imposed 
by the cross of s) is the same as the arrow of t (imposed by the cross of t). 
This can be checked without having the full tiling available. 

If one of s, t is thin and the other one is thick, then uniqueness of arrowing 
holds already for the figure consisting of that s and t¢ only. That arrowing 
on that figure is imposed by the cross of s as well as by the one of t, so in 
particular the imposed arrow on the common edge is the same in both cases. 

The next case is that s and t are both thin. The cases where the cross of 
a thin rhomb contains a second thin rhomb are the P; and P> of Fig. 4. In 
both cases s and ¢t have a thick rhomb u as a common neighbor in the way 
depicted in Fig.6a (possibly with s and ¢ interchanged). Just by observing 
the pair s, u it is seen that the cross of s imposes a double arrow on the 
common edge of s and t, pointing to the right. With the pair t, u the same 
result is obtained for the imposed arrow of t. 
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Fig. 6 The arrowing of s imposed by the cross of s and the arrowing of t imposed 
by the cross of t lead to the same arrow on the common edge 


The remaining case is that both s and ¢ are thick. This is shown in Fig. 6b 
(possibly with s and ¢ interchanged). It is a part of a tiling in which the cross 
condition holds for both s and t. Since the cross of s contains ¢t it cannot 
contain a thick rhomb u pasted to the edge BC: such a configuration formed 
by s, t, u does not occur in any of the crosses of s shown in Fig. 4. The same 
argument shows that the tiling does not contain a thick rhomb pasted to BD. 

The only possibilities to paste thin rhombs to BC and BD are shown in 
Fig. 6c, d. 

In Fig. 6c, the only possible arrowing of the pair s, wu has a double arrow 
from B to A, whence that double arrow is imposed by the cross of s. The 
same argument applied to the pair t, u leads to the same imposed arrow of t. 

In Fig.6d the pair t, u shows that the imposed arrow of t on AB is a 
single one, pointing to the right. Inspection of the pair s, v shows the same 
imposed arrow of s on AB. 

This completes the proof of the theorem. 


2.7 


As a corollary of the theorem of Sect.2 it can be noted that an unarrowed 
rhomb tiling can be arrowed in at most one way. This does not require 
the whole theorem: uniqueness of arrowing holds already for all arrowable 
extended crosses. 
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2.8 


Every arrowable tiling contains infinitely many copies of each one of 
P,,...,P7. Inspection of an arbitrary Penrose tiling shows occurrences of all 
these perfect crosses, and it is known that any finite configuration occurring 
in any Penrose tiling occurs infinitely often in any other one. 


2.9 


An unarrowed rhomb tiling is completely determined by its set of vertices, 
irrespective of whether the tiling is arrowable or not. In order to see this, 
it suffices to check that in an unarrowed rhomb tiling any two vertices have 
distance 1 if and only if they are connected by an edge of a rhomb of the 
tiling. 

It is also easy to see that two vertices in an unarrowed rhomb tiling have 
distance between 0 and 1 if and only if they are the end-points of the short 
diagonal of a thin rhomb. 

The shortest distance exceeding 1 is 2 sin 36°(= 1.17557). It is the length 
of the short diagonal in a thick rhomb, but this distance may occur in a 
different way between vertices in a pair of adjacent thin rhombs. 


3. Deflation and Inflation 


3.1 


This section gives some information about inflation and deflation of Penrose 
tilings. It is intended as a kind of motivating background, but will not be 
used in the rest of the paper. 

Penrose’s idea of inflation and deflation was very central in the discovery 
of his arrowed rhomb tilings. Deflation is a certain way to get from a Penrose 
tiling to a new tiling with smaller pieces ($(—1 + V5) times the original 
ones), and inflation is the inverse operation, leading to a tiling with bigger 
pieces. For a description of the process see [1, 6-8]. It is somewhat easier 
to describe the deflation as an operation on rhombus halves. The half thin 
rhomb is obtained by cutting a thin rhomb along its short diagonal, the half 
thick rhomb by cutting a thick rhomb along its long diagonal. For these 
rhomb halves the deflation is a matter of subdivision (see [5]). This is shown 
in Fig. 7. 

In Sect. 2 of [5] it is explained that the idea of inflation comes naturally by 
observing how in an arrowed rhombus tiling pieces can be grouped together. 
Those groups form the pieces of the inflation. 
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Fig. 7 The deflation of the half thin and the half thick rhomb. The full figure on 
the left is a half thin rhomb, with arrows drawn at the outside. It is subdivided 
into a half thin and a half thick rhomb, with arrows indicated along the edges 
themselves. These two small pieces belong to the deflation. Similarly the full figure 
on the right is a half thick rhomb, and here the deflation has three pieces: a half 
thin rhomb and two half thick rhombs. There are of course two shapes of half thin 
rhombs, which are each other’s mirror image, and similarly there are two shapes 
of half thick rhombs. For the mirror images of the half rhombs shown in the figure 
one has to take the mirror image subdivision. 

When this subdivision operation is carried out for every half thin rhomb and every 
half thick rhomb of a Penrose tiling, the smaller pieces fit together to a Penrose 
tiling with smaller pieces 


3.2 


If deflation is applied to a tiling of a finite portion of the plane, and if the 
result is enlarged by a factor $(1 + V5) afterwards, it becomes a tiling of a 
bigger part of the plane with pieces of the original size. Infinite repetition of 
this process and application of an infinite selection principle leads, albeit in 
an inconstructive way, to Penrose tilings of the full plane (see [4, 6, 8]). But 
the idea of inflation and deflation can also be used in a completely different 
and very constructive way for the production of all infinite Penrose tilings. 
The method is updown generation, a process that is controlled by taking an 
arbitrary infinite path in a particular finite automaton (see [5]). 


4. Duality 


4.1 


The key method in [1] was the production of tilings by means of their topo- 
logical dual. I spend a few words here as a short independent introduction. 


4.2 


In any Penrose tiling the rhombs are arranged in what I shall call stacks. 
Consider an arbitrary edge e of an arbitrary rhomb r of the tiling. Two 
rhombs of the tiling are called e- neighbors if they have a common edge that 
is parallel to e. Two rhombs of the tiling are called e-related if they can be 
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connected by a chain of e-neighbors. The stack generated by r and e is the 
set of all rhombs which are e-related to r. 

Every rhomb of the tiling belongs to exactly two stacks, and any two 
stacks have exactly one rhomb in common, unless their e’s are parallel. 

The idea of a stack can at once be generalized to tilings by parallelotopes 
in higher dimensional spaces. In that case there can be stacks of various 
dimensions. In the two-dimensional case the stacks are one-dimensional, but 
in three dimensions there are two-dimensional stacks that can be considered 
as unions of one-dimensional stacks. 


4.3 


Consider a Penrose tiling and a stack generated by some r and e. In any 
rhomb of the stack connect the mid-points of the edges parallel to e by a 
straight line segment. These line segments form an infinite broken line that 
can be called the back bone of the stack. 

The union of the back bones of all stacks is called the skeleton of the tiling. 
In a skeleton one has curves (the back bones), points (the intersection points 
of the curves) and mazes (the connected components of the complement of 
the union of all curves). These curves, points and mazes can be deformed 
topologically without disturbing their relation to the tiling. On anyone of the 
curves the order of the intersection points with other curves keeps reflecting 
the order in which the corresponding stack is intersected by other stacks. 
The topological deformations do not disturb the duality between skeleton 
and tiling. In that duality curves correspond to stacks, points to rhombs, 
mazes to rhomb vertices. 


4.4 


If some topological deformation of a skeleton is given, one is not yet able to 
reconstruct the tiling. First one has to know, for any curve of the skeleton, 
the direction and the length of the edge e that was used in the discussion 
of the corresponding stack. Once all these are known, the Penrose tiling is 
determined up to a parallel shift in the plane. That is, the Penrose tiling 
deprived of its arrows. But the latter is no great loss: since one started from 
a Penrose tiling in the first place, one knows that a consistent system of 
arrows exists, and Sect. 2 takes care of the uniqueness of the arrowing. 


4.5 


In [1] it was shown that the skeletons are topologically equivalent to so-called 
pentagrids, but this had to be taken with a grain of salt in so-called singular 
cases. The pent agrids are parametrized by real numbers 7,...,y4 satisfying 
yo +... +74 = 0. If j is one of the numbers 0,...,4 and ¢ = e?7*/5 then 


Remarks on Penrose Tilings 471 


the set of all complex numbers z satisfying Re(z¢~/) + 7; € Z is a grid of 
parallel lines in the complex plane. The superposition of these five grids is 
called the pentagrid generated by yo,...,y4. The pentagrid is called singular 
if it contains three lines passing through one point. 

Key results of [1] are the following. Any non-singular pentagrid is the 
dual of a Penrose tiling. The singular pentagrids do not have a dual in the 
ordinary sense, but by infinitesimal perturbations they turn into grids that 
do have a dual. There can be various perturbations with different effect, and 
the result is that a singular grid corresponds either to 2 or to 10 different 
Penrose tilings. With these extra arrangements for the singular cases, all 
Penrose tilings are obtained from pentagrids. 

The perturbations will get some more attention in Sect. 6 below. 


4.6 


The idea of producing a tiling as a topological dual of the superposition of a 
number of parallel grids can be extended to general classes of tilings of spaces 
of arbitrary dimension by means of parallelotopes. See [2] for a proof that 
under fairly general conditions the dual covers the whole space uniquely. 


rey 


It can be concluded from Sect.4 that the skeleton of a Penrose tiling is 
topologically equivalent to some regular pentagrid or to some perturbed 
singular pentagrid. But remarkably, there is always a second and completely 
different straight line grid with the topological structure of the skeleton. This 
matter will be treated in Sects.5 and 6, where it is also shown that there are 
not just these two: there are infinitely many straight line grids with the same 
topology. The pentagrid can be deformed continuously in such a way that it 
always keeps the same topology and always remains a superposition of five 
straight line grids, with the central grid as final result. And one can even get 
beyond that. 


5. Ammann Bars and Central Bars 


5.1 


An attractive way to introduce the Ammann bars of a Penrose tiling is to 
consider both the thin and the thick arrowed rhomb as a billiard table and 
to choose particular billiard ball tracks on them. These tracks can already be 
shown on the rhomb halves, displayed in Fig. 8. The tracks have been chosen 
in such a way that when fitting pieces together the segments combine to 
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Fig. 8 The billiard ball tracks on the arrowed half rhombs. The angles at D, G, 
L and Q are 90°. The lines EF and FG form equal angles with AB, which is 
expressed by saying that the billiard ball bounces at the edge AB. At E, M, N 
and P there are similar cases of bouncing. Finally AD = HL, AF = HN and 
JP = AE = HM. In the mirror images of these half rhombs one of course takes 
the mirror image tracks 


infinite straight lines. These are called the Ammann bars after their discoverer 
R. Ammann (see [7, 9]). 

The grid formed by these bars, called Ammann quasigrid in [9], has 
some of the properties of the pentagrid used to produce the Penrose tiling 
in [1]. Both are superpositions of five parallel grids, making angles which are 
multiples of 72°. In the pentagrid the parallel grids are all equidistant, but 
in the Ammann quasigrid they show two different distances. On the other 
hand, the Ammann quasigrid looks much simpler than the pentagrid since 
its meshes show only a small finite number of shapes. A pentagrid contains 
infinitely many different mesh shapes, and arbitrarily small ones. 

It was observed by J.E.S. Socolar and P.J. Steinhardt (see [9]) that 
dualization of the Ammann quasigrid of a Penrose tiling leads exactly to 
the deflation of that tiling. 

A quite simple suggestive argument can be given for this statement, 
although it might not be easy to turn it into a formal proof. It is explained by 
Fig. 9. The straight line billiard track segments can be continuously deformed 
into the segments in Fig. 10. The points D, E,G, L, M, P can stay where they 
were, but F, N and Q move. In the mirror images of these half rhombs the 
corresponding things are done correspondingly. If in each rhomb of a Penrose 
tiling the original billiard ball track is deformed continuously into the new 
track, then the union of all segments shows the picture of a grid deforming 
itself by bending its bars, all the time keeping the same topology. In the final 
situation (the one of Fig.9) each mesh contains exactly one vertex of the 
deflation, and that is exactly the one produced by the dualization. 


5.2 


The Socolar-Steinhardt phenomenon can also be expressed like this: the 
Ammann bars of the inflation of a Penrose tiling form a quasigrid that is 


Remarks on Penrose Tilings 473 


HM a) K 


Fig. 9 The deformed billiard ball tracks on the half thin and the half thick rhomb. 
In the mirror images of these half rhombs mirror image tracks have to be taken. 
The vertices of the deflation in the above half rhombs are A, B,C, S, H, K, J,U,T. 
Having these tracks in all half rhombs of a Penrose tiling results in a set of mazes 
where each maze contains exactly one deflation vertex, and in the duality that 
vertex is exactly the vertex corresponding to that maze 


U R 
FY Cc 
V, 
S Q 
A EB PT = 


Fig. 10 The basic segments of the central bars. In the thin rhomb ABCD (with 
double arrows AB and C'B) the segments are EF and GH, in the thick rhomb PQRS 
(with double arrows QP and SP) they are TU and VW. There are right angles at E, 
G,T and V. The points F', H, U, W lie on the extensions of the segments CD, AD, 
RS, RQ, respectively. The lengths of the bar segments are EF = GH = sin36°, 
TU = VW =sin 72°. The positions are determined by HB = GB = TP = VP 
0.25. Moreover DF = DH = SU = QW = (—2+ V5)/4 = 0.059017 


the topological dual of that tiling itself. Let me call those Ammann bars of 
the inflation the central bars of the tiling itself. It will be shown that they can 
be defined independently of the billiard ball construction, and that a proof 
for the duality can be given without reference to the argument of Sect. 5. 

The name “central” was chosen because of the the central position of 
these bars in the stacks of the tiling (see Sect. 6). 

The central bars can be found by comparing Figs.7 and 8. The double 
arrows of the deflation are from S to A, from C to B, from T to H and from 
U to J. These are all intersected orthogonally by the billiard ball track. The 
distance from the intersection point to the end-point of the double arrow is 
just one fourth of the length of the arrow. This shows that the central bars of 
a Penrose tiling have to contain the segments EF, GH, TU, VW indicated 
in Fig. 10. 
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Fig. 11 The segments of the deformed bars EJ, GJ in the thin rhomb and TX, 
VY in the thick rhomb, determined by DI = DJ = SX = QY = 0.1. Any maze 
formed by these deformed bars contains exactly one vertex of the Penrose tiling, 
and that is exactly the vertex corresponding to the maze in the duality 


It has to be admitted that some of the rhombs contain further pieces of 
central bars, but in order to build the full central grid it suffices to consider 
those pieces of Fig. 10. Since 


EB=GB=TP=VP, CF =AH =RU=RW (1) 


it is obvious that in a stack the segments in the direction of the stack fit 
together to a full straight line. It is easy to show directly that (1) implies 
EB = GB = TP = VP = }, CF = AH = RU = RW = 3(2+ V5), and 
therefore the central bars can be introduced without any reference to the 
billiard ball tracks. 

The segments of Fig.10 can be transformed into those of Fig. 11 by 
continuous deformation. Taking the segments of Fig.11 in all rhombs of 
a Penrose tiling one gets something that is topologically equivalent to the 
skeleton of the tiling (that was defined in Sect. 4 by connecting mid-points of 
the edges). 

The transition from Figs. 10 to 11 does not affect the topology of the grid. 
This is not completely trivial, since (in contrast to what what was described 
in Sect.5) the operations take place partly outside the rhombs. So it has to 
be made sure that the corresponding operations in neighboring rhombs do 
not interfere. 

In Sect.6 the topological equivalence of pentagrid and central grid will 
be shown in quite a different way. 


6. Algebraic Proof of the Topological Equivalence 


6.1 


The various grids to be considered are generalizations of the one described in 
Sect. 4. They are characterized by real numbers w;,,, where j runs through the 
set {0,1,2,3,4} and & through the set Z of all integers. The grid determined 
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by these w’s is the superposition of I’, 9,...,T.,4, where each I’, ; is is a grid 
of parallel lines in the complex plane. The line with index k in l,, ; is the set 
of all complex numbers z for which Re(z¢~/) = w;,, (as before, ¢ = e?7*/5), 

So the pentagrid described in Sect.4 with its five real parameters 
Yo.-++, 74 Satisfying yo +... +74 = 0 is the case where w;;, = k — ;. 


6.2 


In [1] it was indicated how the pentagrid defines a Penrose tiling. Singular 
grids cannot be used directly: they first have to get an infinitesimal 
deformation in order to admit proper dualization (see [1], Sect. 12). 

A few words may be devoted to the meaning of the word “infinitesimal” 
here. If in a singular pentagrid a grid line, a vertical one, say, is moved over 
a small finite distance to the right, then triple intersection points may have 
been avoided in a big finite range only. For a given line in a pentagrid and 
a given large positive number R there exists a small positive ¢ such that 
within the circular disk given by |z| < R all triple intersections are avoided 
by shifting the line over a distance 6 with 0 < 6 < «. But outside the range 
R such shifts may cause other intersection points, and alter the topology. 
The topology of the grid obtained by an infinitesimal shift to the right can 
be defined as the limit of the topology obtained within |z| < R by means of 
sufficiently small shifts of the grid line. 

But of course, perturbation of a pentagrid is not just a perturbation 
of a single line, but of the set of the five y’s. I now introduce an index p 
that describes which infinitesimal perturbation has to be taken. It has ten 
possible values: 0,...,9, but in the majority of the singular cases there are 
only two that have a different effect, and in the regular cases all ten have 
the same effect. In [1], Sect.9, it was explained that the complex number 
gé= 5 7, ¢79 is an essential parameter for the Penrose tiling. It is related 
to the real numbers jz;, to be used in this section, given by 


Hy = 5 — (Vg-a t+ sa)T, 


for 7 = 0,...,4, where, as before, r = 2Re¢ = 3(-1+v5), and j is taken mod 
5 (so 75 = Yo, etc.). The p’s can be derived from € by Re(€¢~?7) = (1-57) 5. 

It was shown in [1], Sect. 11 that a pentagrid is singular if and only if for 
one of the j’s there is an a of the form (1—¢)(no +1 + nC? +1n3¢3 +4¢*) 
with integers no,...,n4 such that Re((€ —a)¢~/) = 0. It is not hard to show 
that this condition is equivalent to the following one: for at least one value 
of j there is an integer k such that (k — 1;)7 is an integer. 

Now consider infinitesimal perturbations of €, given by a positive infinites- 
imal dw and a perturbation parameter p (one of the values 0,...,9). The 
perturbation of € is d€ = e?™/>dw. And perturbations of the y’s that produce 
this d€ can be taken as (cf. [1], formula (9.2)): dy; = (2/5) Re(¢~*4 dé). 

The perturbations of the y’s turn out to be 
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1 nae 
du; = (1- sn) Re(e'?—49)4/5 day, 


6.3 


The lines of a pentagrid correspond to stacks of the Penrose tiling, and for 
each stack there is a central bar according to Sect.5. These central bars can 
be evaluated in terms of the y’s. This will be explained in Sect. 6. The result 
is that the line with index k in the j-th subgrid of the pentagrid leads to a 
central bar given by Re(z¢~/) = 8;,x, where, at least in the case of a non- 
singular grid, 8; = (1+ $77')(kK + 7[(« — py)t] — $7). ([z] is the usual 
notation for the least integer > x). In singular cases the value of [(« — p;)T| 
can be affected by perturbation of 1; if (k —;)7 is an integer. The effect can 
be described by means of the notations [x], and [a]_,intended as [a+ dw] 
and [a — dw], respectively, where dw is a positive infinitesimal. This just 
means 


[7]+=[#]+1, [2]-=[e] 


for all real values of x. With this notation the result for the central bar 
becomes 


Bie = (1+ 57 w+ rl — ws) Tlytewy — 57) (2) 


where y(p, 7) stands for + or — according to whether Re(e~49)77/) is < 0 
or > 0. 

In the non-singular cases the y(p,7j) can be ignored, of course. But it 
should be noted that it is not so easy to see from the y’s whether a case 
is singular or not. If one does not want to bother one might just take the 
(0,7) as a standard, but it is dangerous to omit the y(p,j) altogether. In 
the case yo = ++: = y4 = 0 with its ten different perturbations the formula 
83h = (1+ 477')(K+7[ KT] — $7) would definitely not represent the central 
bars of a Penrose tiling. 


6.4 


Here are some details about the derivation of the expression (2) for the central 
bar of the stack corresponding to a given line of a pentagrid. For simplicity, 
non-singularity will be assumed. Moreover, the (unessential) restriction is 
made that 7 = 0, which makes it possible to talk in terms of left and right. 
Moreover, the letter 7 becomes available for other purposes. 

So the grid line is (with some integer k) Re(z) = k — yo. The meshes 
directly to the left and directly to the right of this line produce the vertices 
of the rhombs of the stack. According to [1], Sect.5, the vertices are derived 
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from the meshes as follows. Take any point z in the mesh, and form the 
integers K;(z) = [Re(z¢~J) + 4,;]; then the vertex is 0 K,(z)¢?. With a 
real variable t the points of the grid line are represented as (k — yo) + it. So 
the vertices corresponding to the meshes directly to the left of the line are 
given by M(t) = k + S%_,[uj]¢7, where uj = Re(((k — 70) + it)¢-4) + 7j. 
It will be shown that the real part of M(t) takes only four different values 
if t varies. That real part is k + ([u:] + fus]) Re(¢) + ([ue] + [u3]) Re(¢?). 
Obviously [uz] + [ua] = [ui +4] +4, where q is either 0 or 1, and similarly 
[ug] + [us] = [ug +u3] +r, with r either 0 or 1. With the abbreviation 


A=[(k— yo) +1 + v4l7/2 — [—(& —0)7* +2 +9817 7/2 
this leads to 
Re(M(t)) =k +A+4+qRe(¢) +r Re(¢?). 


It gives the four different horizontal coordinates of the left end-points of the 
horizontal edges in the stack. The right end-points are obtained by adding 1. 
So all the vertices involved here are lying on eight vertical lines. The figure 
formed by those eight lines has the central bar as an axis of symmetry, and 
that is why the name “central” was chosen. Since Re(¢) + Re(¢?) = —4 the 
average of the eight horizontal coordinates is k + A+ i whence the central 
bar can be represented as Re(z) = k + A+ 4. 

From the arrowing of the horizontal vertices in the stack it is easy to 
derive that this central bar cuts the doubly arrowed horizontal edges at a 
point 4 from the end, exactly in accordance with Fig. 10. 

These calculations did not yet use the condition that 5°; = 0. Tilings 
without that condition were mentioned in [3] in connection with a riffle shuffle 
card trick, and it is actually the riffle shuffle arrangement in a stack that 
guarantees that there are only eight different horizontal coordinates. 

But >> y; = 0 gives a simplification: 


—(k—y0)7~* +72 +73 = Mot — kr —k, 


and since non-singularity is assumed, this cannot be an integer. If x is not 
an integer one has —[—2] = [a] —1, which leads to the following formula for 
the central bar: 


57). 


For the singular cases the result (2) can now be derived by obvious limit 
operations. 


Re(z) = (1+ ar t\(h + T[(k — po)T] 


6.5 


The topological equivalence of pentagrid and central grid will now be 
established directly, on the basis of (2). Figure 12 presents an illustration. It 
looks reasonable to enlarge the pentagrid by a factor 5/2 when comparing 
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Fig. 12 On the left there is a piece of the pentagrid with parameters yo = 0.2, 
v1 = 0.4, y2 = 0.3, 3 0.8, ya = —0.1, on the right a corresponding piece of the 
central grid with the same parameters. The scale of the picture on the left is 5/2 
times the one on the right. In order to facilitate the comparison of the topologies, 
the lines with k; = 0 in the j-th subgrid are thicker than the others 


it to the central grid (it is the same factor as in [1], Sect.5, formula above 
(5.3)), but for the topology of the grids such factors make no difference at all. 

The method is as follows. Consider two grids of the form of Sect. 6, given 
as Re(z¢~4) = aj, and Re(z¢~’) = 6; ,. It is assumed that for each j the 
aj,. and the 6;,, increase with k. And it is assumed that nowhere in the grids 
three lines pass through a point. In the topological correspondence the k-th 
line of the j-th subgrid of the a-grid will correspond to the k-th line of the 
j-th subgrid of the 6-grid, for all 7 and k. 

If p, g, r are straight lines forming a triangle, then they determine an 
orientation: seen from the inside of the triangle the circular order p, q, 
r is either clockwise or counter-clockwise. This gives the criterion for the 
topological equivalence in the grid: for any three lines in the a-grid the 
orientation has to be the same as for the corresponding three lines in the 
B-grid. 

There are two kinds of triangles here. One is of the kind formed with 
values 7, j —1, 7 +1, the other one with j, 7 +2, 7+3 (j taken mod 5). The 
orientation is a simple matter of determinants. The result is as follows. The 
topological equivalence of the two grids is guaranteed if for all 7 and for all 
integers p, qg, r the combination a;+1,, + @j;~-1,q — T@;,r has the same sign as 
Bj tip + Bj-1,¢ — TBjr, and Aj42,p + Aj-2,q +7 1Q;,r has the same sign as 
Bj+2 + Bj-2,q +7189," 

Since no three lines pass through a point none of these expressions is zero. 

It is this very explicit condition that has to be verified for the regular 
pentagrid and the corresponding central grid. It will be established in 
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Sects. 6-6 as a theorem on inequalities that can be understood independently 
of the previous sections. In order to get rid of the factor (1 + $7~') those 
sections work with 6 instead of 3, where 3 = (1+ 571 )0. And for simplicity, 
Sects. 6-6 restrict themselves to 7 = 0; the other cases are completely similar. 


6.6 


If the conditions mentioned in Sect. 6 are satisfied for a and § then they are 
obviously also satisfied for a and w(A), where 0 < \ < 1, and w(A) is obtained 
by linear interpolation: 


w(A) 5,4 = (1 = A)Q5,k + A Bj ,k- 


This means that the the a-grid can be deformed continuously into the 5-grid 
without ever changing the topology. It is even possible to push the beyond 1. 
The lower bound $7 in Theorem 2 (Sect. 6) has the effect that the topology 
of the w(A)-grid remains the same over the interval 0 < A < (1 — $(1+ 
1/-1),)-1 
aT yrs 

Section 4 discussed infinitesimal perturbations for the interpretation of 
the dual of a singular pentagrid. The same thing can now be achieved with 
the w(A), with small positive but not infinitesimal values of X. 


6.7 
The Sects. 6-6 do not make use of anything said in the previous sections. 
Let Yo,--., 7a be real numbers with yo +... + 74 = 0. For 7 = 0,...,4, 


k; € Z, the real numbers a;,, and 6;,, are defined by 


1 
arn =k- 7, Oj4 =k+7[(k—- w5)T] — x7, 


2 
where jij = yj — (1j-1 + Yj41)T7*, Y5 = 0 and 7 = 5(-1+ V5). 
Let ko,...,k4 be integers, and abbreviate 
H= 1 ky + A4,k4g — TQO,ko> L= O1,ky + 04 keg _— TO0, kos 


K = 2,by +. 03,b3 +77 'O10,ko, M = 82,kq + 93,k3 +77 *90,k0- 
With these abbreviations the following theorem can be proved: 


Theorem 2. If H £0 then L/H > $r. If K £0 then M/K > $7. 


6.8 


Here is the proof of the first part of the theorem. H and L can be expressed 
like this: 
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H=k,+ka—-—tko+7Tp0, 
L=kitka—tko +7[(ki — 1)t] + 7[ (ka — pa)t] — 77 (ko — wo)T] — 74 $7. 


If # and y are real numbers then [x] + [y] — [a+ y] is either 0 or 1. So 
[(ka — par] + [(ka — poa)r] = [ (ka + ha)r — (ua + pa)T] +p 
with p = 0 or p= 1. Moreover 
(ko — Ho)T = ki + kg — A, 


and since (uy + fu4)T = —LM0, 
(ka + ka)r — (ua + pa) = Hr + ko — a — ka. 


So there is a simple expression for L in terms of H and p: 


L=7/Hr "| -—7?/-H] —74 <7? + pr. 


Now first assume H > 0. Use p > 0, —[—H] > 0, and note that if0<c<1 
then [z] > 1—c+ er for all z > 0. The special case x = H7~', c = $7 leads 
to L/H > $r. 

Next assume H < 0. Use p < 1, [—H] > 1, and note that if0<c<1 
then [x] < cx +c for all  < 0. With ¢ = H7~!, c = $7 it follows that 
L/H > 5r. 


6.9 


The proof of the second part of the theorem is similar. K and M can be 
expressed as follows: K = ky + k3 + kot~' — Tyo and 


M = ko+k3+kor + +7[ (ko —pe)T] +7[ (ks — 3a)r] + [(ko — po)T] —7 
Note that 
[(ko — wa)t] + [ (ks — ws)T] = [(ko + ks)t — (ua + s)T] +4 


where gq is either 0 or 1. Moreover (ko — to)T = K — ko — ko — kz, and since 
Ha + 3 = [oT, 


(ke + k3)r = (pe + [3 )T — Kr = ko. 
So M can be expressed in terms of K and gq: 


1 

% 

If K > 0 it can be used that [K] > 1, whence M > r/Kr] —7 +3. If 
0<c<1 then [x] >1—c+ee for all ¢ > 0. With « = Kr, c= 577" this 
leads to M/K > 5T- If K < 0 one can use the inequalities g > 0, [K] < 0 
and [a] < ca+c (4 < 0,0 <c <1) with ¢ = Kr, c = }77*. This again 
leads to M/K > 37, and that finishes the proof of Theorem 2. 


M=r7/Kr|+[K]4+7q-7 
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Distances in Convex Polygons 


Peter Fishburn 


P. Fishburn (=) 
Lucent Technologies Bell Laboratories, Murray Hill, NJ 07974, USA 


Summary. One of Paul Erddés’s many continuing interests is distances between 
points in finite sets. We focus here on conjectures and results on intervertex dis- 
tances in convex polygons in the Euclidean plane. Two conjectures are highlighted. 
Let ¢(x) be the number of different distances from vertex x to the other vertices of a 
convex polygon C, let T(C’) = t(a), and take T,, = min{T(C) : C has n vertices}. 
The first conjecture is T, = (3). The second says that if T(C) = (3) for a convex 
n-gon, then the n-gon is regular if n is odd, and is what we refer to as bi-regular if 
n is even. The conjectures are confirmed for small n. 


1. Introduction 


Let n2 = |n/2], the integer part of n/2 for n > 3. We begin with three 
conjectures about every convex n-gon in R?. 


C1. Its vertices determine at least n2 different distances. 
C2. Some vertex has at least ng distances to the other vertices. 
C3. Each of at least nz vertices has at least ng distances to the other vertices. 


C1 was stated by Erdés in 1946 [3] and settled affirmatively by Altman in 
1963 [1]. C2, another old conjecture of Erdés, is open. C3 was suggested by 
recent work with Erdés. It might be false, but we have no evidence of this. 

My purpose here is to explore two conjectures related to C1—C3. We 
preface them with results for Cl and C2. Throughout, a convex polygon 
is unique up to similarity transformations (translation, uniform rescaling, 
rotation around a point, reflection around a line). R,, denotes the regular n- 
gon, and R,,41 — 1 is the n-gon whose vertices are n of the vertices of R,4+1. 
With d(z, y) the distance between x and y, a length-k run in a convex n-gon 
is a sequence Zp, %1,...,X£~ of successively adjacent vertices for which 


d(xo,21) < d(xo, x2) < +++ < d(x, ax). 


Theorem 1. C1 is true. Suppose convex n-gon C has exactly ng different 
intervertex distances. If n is odd then C = R,,. If n is even and n > 8 then 
Ce {Rn, Rn+i — 1}. 


Altman [1] proves all but the final sentence, which is proved in [8]. At 
n = 6, a third hexagon besides Rg and R7 — 1 has exactly three intervertex 
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distances (Ag, defined below); at n = 4, two other quadrilaterals besides R4 
and Rs — 1 have exactly two intervertex distances. 

The next theorem, from [6], gives the best result known toward C2. It 
implies that some vertex has at least n/3 different distances to the others. It 
is not known if there is A > 1 such that every convex n-gon (n > 4) has a 
vertex with at least |An/3] distances to the other vertices. 


Theorem 2. For every n> 4, every convex n-gon has a run of length |(n+ 
3)/3], and some convex n-gons have no run of length |(n + 3)/3] +1. 


The proof uses the following observation in Moser [12]. 


Lemma 1 (Moser’s). Every conver n-gon has a circumscribed circle with 
a sub-semicircular arc ending in vertices such that the region enclosed by 
this arc and the chord between its endpoints contains at least |(n — 1)/3| 
other vertices. Beginning from either endpoint, these vertices produce a run 
of length at least |(n + 2)/3]. 


Figure 1 illustrates Moser’s lemma and some special polygons, including 
the bi-regular polygons Ay, Ag and Ag. 

Convex 2k-gon Azz is called bi-regular because it is formed from two 
regular polygons. A, is composed of two equilateral triangles with a common 
side. To form Ap; for k > 3, begin with a fixed copy R? of R; with maximum 
intervertex distance 1. On each ray from the center of R? that bisects a side, 
add a vertex that has maximum distance 1 to those of R?. The vertices of 
Ao, are the k of R? and the added k on the perpendicular bisectors of the 
sides of R. 

Let t(2) be the number of distances from vertex x to the other vertices 
of a convex n-gon C. The t-sequence of C, which we view cyclically, 
is (t(@1),t(v2),...,t(@n)) with %1,22,...,% the vertices in clockwise or 
counterclockwise succession. Let T(C) = t(x1) + t(v2) +--+: +t(ay). We 
focus on 


Tn = min{T(C) : C is a convex n-gon}. 


For odd n, T(Rn) = n{(n — 1)/2] = (5) with t-sequence ((n — 1)/2,...,(n— 
1)/2). For even n, 


R, has t-sequence (n/2,n/2,...,n/2); 
A, has t-sequence (n/2 — 1,n/2,n/2—1,n/2,...,n/2—1,n/2), 


so 
T(Rp) = n2/2 and T(An) = (5) <n?/2. 


Since we have no intimations of smaller values for T(C), we consider two 
conjectures. The first was introduced in [4]. 
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Moser's lemma 
forn=13 


C1. 7, = () forall m2 3. 
C5. For all n > 3, if T(C) = (3) for a convex n-gon C, then C = R,, if n is 
odd, and C = A, if n is even. 


There is no apparent implication between these conjectures. If C4 is true, 
polygons other than R, and A, might realize T,,. If C5 holds, there might be 
other polygons that have T(C) < (}). Because C4 implies that the minimum 
average t(x) is (n — 1)/2, it strengthens C2 but is apparently independent of 
C3. And C5 does not obviously imply either C2 or C3. 

Suppose the preceding conjectures are all true. When n is odd, R, is the 
unique realizer for T,, and, by Theorem 1, for the minimum nz of Cl. But it 
clearly does not minimize the number of vertices with t(x) > nz in regard to 
C3 for n > 5. When n is even and n > 8, the realizers R, and R,+41 — 1 for 
ng in Cl differ from the unique realizer A, for T,,. Moreover, since A, has 
exactly n/2 vertices with t(x) > n/2, it gives the tightest possible realization 
of C3. 
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Evidence for C4 and C5 is provided in the next two sections. Section 2 
sketches proofs which show that C4 is true for n < 8, and Sect.3 outlines 
proofs which verify C5 for n < 7. Although some of the proofs are case- 
intensive, it is hoped that their ideas will encourage refinements or extensions 
that will settle the conjectures fully. 

Section 4 offers further remarks on C2—C5, then discusses a set of distance 
problems for convex polygons motivated by the conjecture of Erdés and Moser 
which says that the maximum number of pairs of vertices {x, y} in a convex 
n-gon for which d(x, y) = 1 is bounded above by cn for some constant c. 


2. Evidence for C4 


Theorem 3. C4 is true for n < 8. 


Since t(a) > 1 for all x, T3 > 3, and T3 = 3 is uniquely realized by Rs. 
When t(z) = 1 for some x, the other n — 1 vertices lie on a subsemicircular 
arc with center at x. Equal spacing along this arc yields the following lemma. 


Lemma 2. Suppose t(x) = 1. If y is adjacent to x for convex n-gon C then 
t(y) > n—2. Moreover, 


T(C) > (3n? — 8n +9)/4 if n is odd; 


T(C) > (3n? — 8n + 8)/4 if n is even. 


It follows that the minimizing t-sequence for n = 4 is (1,2,1,2). Thus 
T, = 6, which is uniquely realized by Ag. 

For n > 5, Lemma 2 implies that T(C) > (5) when t(x) = 1 for some 
vertex, so we assume henceforth that min t(x) > 2. The minimizing t-sequence 
at n = 5 is then (2,2,2,2,2), with T; = 10 realized by Rs. The next section 
shows that Rs is the only pentagon for which T(C) = 10. 

The next lemma will be used to verify Tg = 15. 


Lemma 38. /f x and y are adjacent vertices of convex n-gon C and t(x) = 
t(y) = 2, thenn <5. 


Proof. Let the hypotheses of the lemma hold. Assume without loss of 
generality that x = (0,0), y = (1,0), and the other vertices of C lie above 
the abscissa. We have d(x, y) = 1. Let d, and d, be the second intervertex 
distances for x and y respectively. Since the other vertices lie at intersection 
points of the x-centered circles of radii 1 and d, and the y-centered circles 
of radii 1 and dy, n < 6. To obtain n = 6, all four intersections above the 
abscissa must occur, so assume this. Then max{d,, dy} < 2. 

It is easily seen that convexity is violated at n = 6 if either d, = dy or 
min{d,,d,} <1. The latter is illustrated in Fig. 2a. Assume henceforth that 
1<d, < dy < 2 as pictured in Fig. 2b. Point g has d(x, q) = d(y,q) = 1. The 
other three intersection points are p, z and v. 
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Tangent 
- atq 


x y x 72 y 
(0,0) (1,0) 0 1 
d<1<d, 1<d,<d,<2 


Fig. 2 t(x) =t(y) =2 


We prove that convexity is violated at n = 6 by showing that q lies below 
line segment vp. With d, fixed, the best chance for convexity occurs when 
dy is near 2 [z near (—1, 0)] since this moves v down the d, circle. We prove 
that vp intersects the perpendicular bisector | of xy above q when dy, = 2. 

Fix dy = 2 and let d = dz. Let s be p’s horizontal component. Since 
cos@ = (1 + d? —1)/2d = d/2 and cos@ = s/d, s = d?/2. Then p’s vertical 
component is [d? — (d?/2)?]1/?. Similar computations for v give 


p = (d”/2,[a? — (a? /2)"}"/?) 
v = (-(3—a’)/2, [@ — {(3-d?)/2}7]””). 


Line segment ap+(1—a)v has first component a(d?/2)—(1—a)(3—d?)/2, 
which equals 1/2 for intersection with | when a = (4 — d?)/3. Let h be the 
height at which ap + (1 —a)v intersects L. Then 


h = {d(4 — d?)9/? + (a? — 1)[4d? — (3 — d?)?]*/*}/6. 
Since g = (1/2, 3/2), our claim that vp intersects 1 above q is 
GaP )P4re@ =e = Ger r= 3y3. 


Let u = d? —1,0 <u <3. Then, after squaring both sides of the preceding 
inequality, it reduces to 


(3 — u)3/2/8 + Tu — u2 > (9 — 8u+ u?) Ju. 


This holds when u > 2 since its right side is then negative. Suppose u < 2. 
We square sides and cancel identical terms to get 216—27u > 81u, ie., u < 2, 
so the inequality holds for all 0 < u < 3. 


It follows from mint(#) > 2 and Lemma 3 that the smallest possible 
T(C) at n = 6 occurs uniquely for the t-sequence (2,3, 2,3, 2,3). Since this 
is realized by Ag, Tg = 15. 


488 Peter Fishburn 


x y 


Increase 
dy 


Fig. 3 t(x) =3, t(y) =2 


Consider n = 7. Since R7 has t-sequence (3, 3,...,3), T7 < 21. If adjacent 
vertices never have t-counts of 2 and 3 but ¢(x) = 2 for some x, then T > 22, as 
with (2,4, 2,4,3,3,4). Hence T < 20 is conceivable only if there are adjacent 
vertices with t(2) = 3 and t(y) = 2. These t-counts are possible at n = 7, but 
only under special circumstances. 


Lemma 4. Suppose t(z) = 3 and t(y) = 2 for adjacent vertices of C with 
d(x,y) =1. Let d, < dz be the second and third distances for x; let dy be the 
second distance for y. Thenn <7, andn < 6 if min{d),d,} <1. 


Proof (outline). Lemma 3 requires n < 5 if we omit the third x distance. 
Since t(y) = 2, at most two more vertices are added by «’s third distance, 
son < 7. If dy < 1, straightforward geometric arguments yield a convexity 
violation if n > 6. If dy > 1 but d; < 1, convexity also forces n < 6. When 
1 < dy for this case, we use the argument in the latter part of the proof of 
Lemma 3. 


Figure 3 illustrates 1 < min{d,,d,} for Lemma 4. The top diagram has 
all possible circle intersections (n < 8). As drawn, g must be removed for 
convexity at n = 7, but p could be removed instead if d, were a bit smaller. 
By increasing d, we lose intersection point p, but can still have a convex 
heptagon with the seven remaining points. 
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Modulo minor changes in d;, dz and d,, Fig. 3 shows the only ways we 
can have a convex heptagon when there are contiguous t-counts 2 and 3. To 
have T < 20, the five vertices besides x and y must have a t sum no greater 
than 15, or an average of t < 3 per vertex. It is easily checked that no other 
vertex has t = 2, and some clearly have t > 4, so in fact T(C) > 22 for 
these heptagons. We conclude that T7 = 21, which is attainable only with 
t-sequence (3, 3,...,3). 

Finally, consider n = 8 where Ag with t-sequence (3, 4, 3, 4, 3, 4, 3, 4) gives 
Tg < 28. By Lemmas 2 and 3, the minimum sum of adjacent t-counts for a 
convex octagon is 6, with (t(x), t(y)) either (4,2) or (3,3). In the first case, 
addition of a ds; circle centered at x on Fig.3 will give a T(C) for n = 8 at 
least in the low 30’s. A similar analysis for (3, 3) yields a similar result. I omit 
details. It follows that Tg = 28. 

At the present time I know of no convex octagon for which 28 < T(C) < 
32. We have T(Rg) = 32, but Rg is not the only octagon with T = 32. An 
example is O; on Fig. 6 in [9]. Its t-sequence is (2,5, 4,5, 2,5, 4,5). 


3. Evidence for C5 


Theorem 4. C5 is true for n < 7. 


The preceding section verifies this for n < 4 and —— for n € {5,6,7} 


that the unique t-sequences that have T(C) = (3) ar 
(2, 2, 2, 2,2) forn=5 
(2,3,2,3,2,3) forn=6 
(3, 3, 3, 3,3, 3,3) for n = 7. 


These t-sequences are assumed throughout this section. We also let m denote 
the number of different intervertex distances in a convex polygon. 

Suppose n = 5. If m = 2, we have Rs by Theorem 1. If m = 3, Theorem 2 
and Fig. 2 in [9] show that ¢(x) > 3 for some vertex. Suppose m = 4. Denote 
the four distances by d; through d4 and label each vertex with its two 
distances. Suppose without loss of generality that one is labeled d,dz and 
another d,d3. Then each of the other three vertices has (d; or dz) and (d, or 
ds), and to obtain a fourth distance one of these has label did4, so each of 
the remaining two also has (d; or d4). Then, because t(x) = 2 for all vertices, 
all five have label d,, and this yields the contradiction that one of the other 
labels is used on only one vertex. A similar contradiction holds when m > 4, 
so Rs is the only pentagon with t-sequence (2,2, 2, 2,2). 

Let n = 6. Label the vertices 1 through 6 in succession clockwise and 
let jk denote d(j,k). By Theorem 2 there is a run of length 3. Assume 
for definiteness that 12 = a < 13 = b < 14 = c: see Fig.4a, where 
circled numbers give the t-sequence. By Moser’s construction [12] there is 
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a © b ® c 
VEO 


a<b<c Infeasible Feasible 
246 = Moser's A 


Fig. 4 n=6, (t(1),...,t(6)) = (3, 2, 3, 2, 3, 2) 


a minimum-diameter circumscribed circle that contains three vertices that 
give a triangle with each interior angle no greater than 7/2: see Figs. 4b, c. If 
one of the resulting subsemicircular sectors contains two of the other vertices, 
its end vertices both have t > 3, a contradiction to the t-sequence pattern 
(2,3,2,3,2,3). We therefore have the arrangement of Fig. 4c, where all three 
vertices of the noted triangle have either t = 2 or t = 3. If they have t = 3, we 
get a contradiction, for each of other three vertices has t = 2, which would 
force us back to Fig.4b or to the conclusion that some vertex lies outside 
the circle. As a consequence, Moser’s construction implies the arrangement 
shown in Fig. 4a. 

Because the sectors outside the sides of Moser’s triangle are subsemicir- 
cular, 21 < 26 and 23 < 24. Since t(2) = 2, we have 23 = 21 = a and 
26 = 24. By analogy, 43 = 45, 42 = 46, 65 = 61 and 64 = 62. Since 
26 = 24 = 46, Moser’s triangle is equilateral: see Fig. 4d. Let d = 26. We 
have 43 = 45 < d = 42 = 46, c = 41 and t(4) = 2. A simple geometric 
argument implies that c = d: if c = 43 with 1 as pictured, 3 would lie outside 
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the circle. This brings us to Fig. 4e. For reasons similar to those just given for 
c =d, we have 25 = 63 = c. Therefore triangles 124 and 326 are congruent, 
so the line through 1 and 3 is parallel to the line through 6 and 4, and as a 
consequence a’ = a*. Similarly, a’ = a, so all sides of the hexagon have the 
same length. These and the six c diagonals define Ag. 

My proof that R7 is the only convex heptagon C with t-sequence 
(3, 3,...,3) is unreasonably long. A shorter proof is needed. The main steps 
in mine are as follows: 


1. Show that C = R; if all sides are equally long. 
2. Prove that d(x,z) > max{d(x,y),d(y,z)} whenever x, y and z are 
consecutive vertices. 


Label the points 1 through 7 clockwise in such a way that Moser’s lemma for 
n = 7 gives 1 through 4 in a subsemicircular sector of a circumscribed circle. 
Set 12=a<138=b<l4=c. 


3. Prove that 17 = a, 16= band 15=c. 
4. Apply the result just proved in a sequence of steps (the first is 43 < 
42 < 41 => 45 = 43, 42 = 46, 41 = 47 = cc) to conclude that C = R7. 


The main labor involves eliminating other possibilities in step 3. 


4. Discussion 


Conjecture C4, which claims that T;, equals ae evolved from work on C2, and 
in turn suggested conjectures C3 and C5. It has been featured here because 
its appealing form and global character may suggest approaches that prove 
it as well as its parent C2. 

We noted earlier that, in view of A,, C3 is formulated as tightly as 
possible for even n. This is not true for odd n. At least four vertices of a 
convex pentagon have t > 2, and at least five vertices of a convex heptagon 
have t > 3. A small challenge is to give a convincingly tight alternative to C3 
for odd n. 

Another set of conjectures for distances in convex polygons is based 
on multiplicity vectors. The multiplicity vector of n-gon C is r(C) = 
(r1(C), r2(C),...,T%m(C)) where m is the number of different intervertex 
distances and r;(C) is the number of times the ith most-frequent distance 
occurs, ties resolved arbitrarily. Thus r1(C) > re(C) > --- > rm(C) > 1 and 
De ri(C) = (3). 

Let rj(n) = max{r;(C) : C is a convex n-gon}. Erdés and Moser [7] 
conjecture that r;(n) < cn for some constant c. The best general bounds 
we are aware of are 2n — 7 < r(n) < mn(2log.n — 1), due to Edelsbrunner 
and Hajnal [2] and Fiiredi [11] respectively. 
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It is known [5] that ro(n) = n for 5 < n < 8 and that r2(25) > 25. We 
do not know the smallest n at which the second most-frequent distance can 
exceed n, nor do we have a very good idea of the growth rate of r2(n)/n. 

It is not known if r3(n) > n for some n. 

We conjecture [4, 5] that ©[r;(C)]? is maximized uniquely over all convex 
n-gons at C = R,,, except for n € {4,6,8}. The nonregular maximizers for 
the exceptional cases are identified in [9]. 

Many years ago Danzer (see [1]) disproved an Erdés conjecture [3] by 
constructing a convex 9-gon in which each vertex has three others equidistant 
from it. Fishburn and Reeds [10] constructs a convex 20-gon in which each 
vertex has three others distance 1 from it. Erdés and Fishburn [4] conjecture 
that there is no convex n-gon in which every vertex has distance 1 to four 
other vertices. If true, then r1(n) < 3n — 6. 
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In the last 6 years, several combinatorics problems have been solved in an 
unexpected way using high degree polynomials. The most well-known of these 
problems is the distinct distance problem in the plane. In [Erdés46], Erdés 
asked what is the smallest number of distinct distances determined by n 
points in the plane. He noted that a square grid determines ~ n(logn)~!/? 
distinct distances, and he conjectured that this is sharp up to constant factors. 
Recently, an estimate was proven that is sharp up to logarithmic factors. 


Theorem 1 ([Guth—Katz11], building on [Elekes—Sharir10]). For any 
n point set in the plane, the number of distinct distances is > cn(logn)~t. 


The main new thing in the proof is the use of high-degree polynomials. 
This new technique first appeared in Dvir’s paper [Dvir09], which solved 
the finite field Nikodym and Kakeya problems. Experts had considered 
these problems very difficult, but the proof was essentially one page long. 
The method has had several other applications. The joints problem was 
resolved in [Guth—Katz10]. The argument was simplified and generalized in 
[KSS10] and [Quilodrdn10], leading to another one page proof. A higher- 
dimensional generalization of the Szemerédi-Trotter theorem was proven in 
[Solymosi-Tao12]. And several fundamental theorems in incidence geometry 
were reproved in the paper [KMS12]. 

The new trick in these proofs can be summarized as follows. We want 
to understand some finite set S in a vector space. We consider a minimal 
degree (non-zero) polynomial that vanishes on the set S. Then we use 
this polynomial to study the problem. This strategy is somewhat surprising 
because the statements of the problems often involve only points and lines. 
The joints problem and the finite field Nikodym problem can be solved in a 
page each using high degree polynomials but seem very difficult to solve 
without polynomials. Why polynomials play such a crucial role in these 
problems is somewhat mysterious. 

The point of this essay is to explain how these new methods work and to 
reflect on them philosophically. The main theme is the connection between 
combinatorics and algebra (polynomials). 

Here is an outline of the essay. 
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We begin by giving two detailed examples of the polynomial method: the 
finite field Nikodym problem and the joints problem. This is the subject of 
Sect. 1: Examples of the polynomial method. 

Once we’ve seen a couple examples of this method, we’re going to work 
on understanding “where it comes from”. In Sect.2, we discuss where the 
method comes from historically. We discuss related arguments from other 
areas of mathematics. Polynomials are fundamental mathematical objects, 
and there are many different perspectives about them. Section 2 is called 
‘Perspectives on polynomials’. We will see perspectives about polynomials 
coming from number theory, coding theory, and differential geometry. Each 
of these perspectives helps to understand why polynomials are useful in these 
combinatorial problems. 

In Sect. 3, we describe the new results in incidence geometry proven with 
polynomials, and we put them in perspective in the field. We recall the 
Szemerédi-Trotter theorem—a central result in the field—and discuss why 
the problem is difficult. We discuss one of the important methods in the 
field—the cutting method of [CEGSW90]. The Szemerédi-Trotter theorem 
involves lines in the plane. More generally, it is interesting to try to study 
k-dimensional objects in n-dimensional space. There are new challenges in 
higher dimensions. In particular, there is a new difficulty in dealing with 
objects of codimension > 1, such as lines in R® or 2-planes in R*. We take 
some time to explain why this type of problem is hard to understand 
using previous methods. The distinct distance problem appears at first sight 
(and second and third...) as a problem about circles in the plane, but Elekes 
found a way to rephrase it as a problem about curves in three dimensions. 
In particular, we will meet two theorems about lines in R? that are closely 
connected to the distinct distance problem and that illustrate the difficulties 
of incidence geometry in codimension > 1. 

In Sect. 4, we explain how polynomials can be used to study incidence 
geometry. Section 4 is called ‘Combinatorial structure and algebraic struc- 
ture’. We will explain the main ideas in the proofs of the two theorems at the 
end of Sect.3. More broadly, we will try to explain the mechanisms why a 
configuration with a lot of combinatorial structure is forced to have a special 
polynomial structure. 

This essay is for a volume on the mathematics of Paul Erd6és. Erdés’s 
ideas influenced the work we describe in many ways. He posed the distinct 
distance problem in [Erdés46]. This paper was one of the first papers in 
incidence geometry, perhaps the first, and the problem has shaped many 
ideas in the field. I am a big admirer of hard problems that are simple to 
state. The most exciting—in my opinion—is a simply stated problem that is 
hard for a new reason. I think Erdés’s distance problems are such problems. 
They helped create and guide a whole field of math. Mathematicians working 
in incidence geometry have made a great effort to clarify the nature of the 
difficulty of these problems, and then to find methods to deal with these 
difficulties. We describe here one chapter of this story. 
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1. Examples of the Polynomial Method 


Because some of the arguments are so short, I think the best introduction to 
the polynomial method is to look at some proofs. We give detailed sketches 
of two proofs, and then we will step back and talk about them. 


1.1 The Main Ingredients 


There are two basic facts about polynomials which are the main ingredients 
in the arguments. If F is a field, let Polyp(F”) be the space of polynomials 
over F with degree < D and n variables. Polyp(F”) is a vector space over F. 


Proposition 1.1. The vector space Polyp(F") has dimension C2") > 
D"/n. 


Proof. A basis is given by the monomials pee ...,0P" with Dy ++.) + 


D,, < D. By the ‘stars and stripes’ argument, the number of monomials 
: D+n 
is) 


As a corollary, we can estimate the degree of a polynomial that vanishes 
at prescribed points. 


Corollary 1.2 (Parameter counting). If S C F” is a finite set, then there 
is a non-zero polynomial that vanishes on S' with degree < n|S|!/". 


In rough terms, when we choose a polynomial in Polyp(F”), we have 
(@ 7) parameters at our disposal. As long as (a) > S, we have enough 
parameters to arrange a non-zero polynomial that vanishes at every point of 


S. Linear algebra makes this heuristic rigorous. 


Proof. We let Fen($,F) be the vector space of functions from S$ to F. 
Restricting polynomials to the set S gives a (linear) map Polyp(F") > 
Fen(S,F). There is a non-zero polynomial of degree < D vanishing on S if 
and only if this linear map has a non-trivial kernel. As long as the dimension 
of the domain is bigger than the dimension of the range, the linear map does 
have a non-trivial kernel. The dimension of the domain is CG pay and the 
dimension of the range is |.S|. By a brief computation, we can always choose 
D<n|S|¥" so that (?**) > |S]. 


The second main fact is that a non-zero polynomial in one variable 
cannot have more zeroes than its degree. A little more generally, we have 
the following. 


Lemma 1.3 (Vanishing lemma). Jf L is a line in a vector space and P is 
a polynomial of degree < D, and if P vanishes at D+ 1 points of L, then P 
vanishes on L. 


With little more than these tools, we will solve two hard problems about 
how lines intersect in vector spaces. 
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1.2 The Nikodym Problem in Finite Fields 


Let F, be a finite field with g elements. A set N C F@ is called a Nikodym set 
if for each point x € F/, there is a line L so that L \ {x} C N. The question 
is, “how big does a Nikodym set need to be?” The paper [Dvir09] proves that 
a Nikodym set needs to have at least c,q” elements—it needs to contain a 
definite fraction of the points in F7. 


The history. The problem above is a finite-field adaptation for a problem 
in Euclidean geometry. A set N C [0,1]” is called a Nikodym set if for each 
x € [0,1]”, there is a line L so that N contains LM [0,1]" \ {x}. The main 
question is “how big does a Nikodym set need to be?” Nikodym proved in the 
20s that there are Nikodym sets of measure 0. The Nikodym conjecture says 
that the (Hausdorff or Minkowski) dimension of a Nikodym set is always n. 
(This roughly means that the 6 neighborhood of a Nikodym set must contain 
nearly d~” 6-boxes.) 

The Nikodym conjecture is a major open question in harmonic analysis. 
From our brief description, it’s not at all clear why the problem is con- 
sidered important. The Nikodym problem turns out to have connections to 
fundamental problems in Fourier analysis and PDE, including the restriction 
problem. The restriction problem was raised by Stein in the 1960s, and it 
has played a major role in Fourier analysis ever since then. The Nikodym 
conjecture is a close cousin of the more well-known Kakeya conjecture. 
The connection between these geometrical questions and problems in Fourier 
analysis and PDE is described in [Laba08] and [Tao01]. 

Mathematicians have put a lot of effort into the Nikodym and Kakeya 
problems but remain far from a complete solution. Because the problems 
seem difficult, analysts have begun working on a variety of cousins and 
model problems that may shed some light back on the original problems. 
In [Wolff99], Wolff proposed looking at the finite field analogues of these 
questions. Proving that the Minkowski dimension of a Nikodym set in [0, 1]” is 
at least a is analogous to proving that a Nikodym set in Fj has Z g* elements. 
In particular, Dvir’s theorem is analogous to the Nikodym conjecture. 


The proof of the finite field Nikodym conjecture. Let us assume that 
N CF; is a Nikodym set with < (10n)~"g” elements. Let P be a non-zero 
polynomial that vanishes on N with minimal degree. 


1. By parameter counting, the degree of P is < n|N|!/" < q—1. 

2. By the vanishing lemma, P(x) = 0 at every point x € F}. To see this, 
consider the line LZ given by the definition of the Nikodym set. We know 
that « € L and that |LM N| > q-—1. So P vanishes on q — 1 points of L, 
and since deg(P) < q—1, P must vanish on all of L. 

3. Once we know that P vanishes at every point (and that deg(P) < q—1), 
it’s not hard to show that all the coefficients of P are zero. In other words, 
P is the zero polynomial and we have a contradiction. 
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The Kakeya problem. The Nikodym problem is a close cousin of the more 
well-known Kakeya problem. A Kakeya set in R” is a set containing a unit 
line segment in each direction. The Kakeya conjecture says that any Kakeya 
set in R” must have dimension n. A Kakeya set in F7 is a set containing a 
line “in every direction”. More precisely, a Kakeya set contains a translate 
of any line in Fj. By a small modification of the argument above, [Dvir09] 
proves that any Kakeya set in F} contains > c(n)q” points. 


The influence. This proof shocked the harmonic analysis community. 
Analysts exchange stories about where they were when they heard about it. 
In [Erdés], Erdés told a story about how hard it is to judge the difficulty of 
a problem. This is the most dramatic example that I have personally 
encountered. The Nikodym and Kakeya and restriction problems are closely 
connected, notoriously difficult problems of analysis. I believe that the finite 
field version was considered roughly as difficult as the original version until 
it was proven in one page. (To be fair, I should also say that the finite field 
version was only open for about 10 years, and it was much less studied than 
the original problem.) 

After the shock, people tried to adapt the new method to the original 
Nikodym and Kakeya problems in Euclidean space. So far, not much has 
been proven this way. It remains to be seen whether these methods will lead 
to progress in harmonic analysis. But the polynomial method has had a lot 
of influence in combinatorics. In this section we give one more example: the 
joints problem. 


1.3 The Joints Problem 


Suppose that £ is a set of L lines in R”. (The case n = 3 is a good case 
to keep in mind.) A joint is a point that lies in n lines of £ with linearly 
independent tangent directions. In other words, if the lines of £ thru x do 
not all lie in a hyperplane, then x is a joint. The problem is, how many 
joints can we make with LD lines? The joints theorem says that the number of 
joints is < L™-1, This number is sharp up to constant factors. For example, 
consider S hyperplanes in general position. Any n — 1 hyperplanes intersect 
in a line, giving L = Ge lines. Any n hyperplanes intersect in a point, and 
(6). of these points is a joint for our set of lines. So the number of joints is 
(°) ~ Lat, 


n 


The history. The joints problem was posed by Chazelle, Edelsbrunner, 
Guibas, Pollack, Seidel, Sharir, and Snoeyink in [CEGPSSS92]. They thought 
of the problem as a model problem for some difficult (still open) problems con- 
nected with computer vision. The original problem was in three dimensions. 
The best known bound before the polynomial method was that the number 
of joints is < L'6?, [Feldman-SharirS05]. The paper [Guth—Katz10] proved 
the joints conjecture in three dimensions using the polynomial method. 
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The papers [KSS10] and [Quilodrdn10] simplified the proof and generalized 
the result to any dimension. 


The proof. We will prove the following main lemma: In any arrangement of 
lines in R” with J joints, one of the lines contains < J‘/” joints. The theorem 
follows from this main lemma by elementary counting. Given LD lines and 
J joints, we remove the lines one at a time, using the main lemma to find an 
unpopular line to take out. Each time we remove a line, at most J!/" joints 
disappear. Therefore, J < LJ'/", and rearranging gives the theorem. 

To prove the main lemma, we let P be a non-zero polynomial of minimal 
degree that vanishes on all the joints. 


1. By parameter counting, the degree of P is < J\/”. 

2. If a line / contains > deg(P) joints, then P vanishes on the whole line. 
So it suffices to find a line 1 € L so that P is not identically zero on I. 

3. If P vanishes on all of the lines of £ going thru a joint x, then VP vanishes 
at x. This is because V P(x) vanishes in the direction tangent to each line, 
and the span of the tangent directions is all of R”. So if P vanishes on 
all the lines in £, then each partial derivative 0;P vanishes at each joint. 
We know that P is not constant, so one of these partial derivatives is non- 
zero, and it has degree < deg(P). This contradicts the definition of P as 
having minimal degree. 


The influence. Starting from these two proofs, this little trick with high 
degree polynomials has become a major tool in incidence geometry. It has 
helped resolve several old problems and led to new proofs and perspectives 
about fundamental theorems. We will discuss the resulting ideas in Sects. 3-4. 


1.4 Why Polynomials? 


The proofs of the finite field Nikodym conjecture and the joints conjecture 
feel like the “right” proofs to me because they are so short and because 
the problems seemed very difficult before. But the proofs still seem a little 
mysterious to me. These are questions about points and lines, and yet it 
seems to be crucially important to use high degree polynomials to understand 
them. Is it really much harder to prove these results without using high degree 
polynomials? If so, why are polynomials so connected with these problems? 
I have been thinking about these questions and discussing them with people 
for several years. In this essay, I will share the observations that I know. I still 
wish I understood the questions better. 

If we play around with questions about how lines intersect in R°, then we 
will come to an important example that involves a degree 2 algebraic surface. 
Let’s try a few questions, beginning very naively. If £ is a set of lines, an 
intersection point is a point that lies in at least two lines. 
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Question 1. Given L lines in R°, how many intersection points can 
there be? 


There can be at most G ) intersection points, since any two lines intersect 


at most once. This upper bound is sharp. If all the lines lie in a plane, and if 
they lie in general position within the plane, then there will be C) distinct 
intersection points. 

Perhaps a set of lines in space can have many intersection points only by 
clustering in a plane? We can probe this issue with the following question. 


Question 2. Suppose that £ is a set of L lines in R? with < 10 lines in any 
plane. How many intersection points can there be? 


Remarkably, there can still be ~ L? intersection points. Here we come 
to a crucial example involving a degree 2 algebraic surface. The surface is 
defined by the equation z = xy. This surface contains a lot of lines. For any 
number b € R, let Hy be the “horizontal” line (x,b,ba),2 € R. For any 
number a € R, let V, be the vertical line (a,y,ay),y € R. The horizontal 
lines and the vertical lines both lie in the surface z = xy. The horizontal 
line Hy and the vertical line V, intersect at (a,b, ab). Let £ consist of L/2 
horizontal lines and L/2 vertical lines. These lines intersect at L?/4 distinct 
points, so £ has > L? intersection points. The intersection of a plane with 
the surface z = ry is a degree 2 algebraic curve, and so it contains at most 
two lines. Therefore, any plane contains < 2 lines of £. 

(This degree 2 surface is an example of a regulus. Reguli play an important 
role in the approach to the joints problem in [CEGPSSS92].) 

Although Question 2 is about points and lines, the key examples do not 
just involve linear objects (lines, planes, etc.)—they also involve algebraic 
surfaces. This example gives one motivation why polynomials play a role in 
incidence problems about lines and points. 

Let’s follow our investigation a bit further. Lines may have many 
intersection points by clustering in a plane or in a degree 2 surface. Let’s 
forbid both types of clustering. 


Many Intersections Problem. Suppose that & is a set of L lines in R° 
with < 10 lines in any plane or degree 2 surface. How many intersection 
points can there be? 


This time, there will be far less than L? intersection points. The methods 
of [CEGPSSS92] show that the number of intersection points is < L5/%. Using 
the polynomial method, the paper [Guth—Katz11] shows that the number 
of intersection points is < L°/?, This estimate plays a role in the distinct 
distance estimate, and we will discuss it more later. 

The many intersection problem is a significant open problem. The best 
current upper bound on the number of intersection points is ~ L°/?. 
The examples I know all have < L intersection points. 
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We can get a little perspective on this problem by naive parameter 
counting. The set of lines in R° is a 4-dimensional manifold. If we choose 
L lines, we are choosing 4£ parameters. In fact, there is no real loss in 
generality in assuming that each line is given by a graph « = az+b,y = cz+d. 
So we can specify L lines by 42 real parameters aj,...,az,01,..., bz, ete. 
The condition that line 2 intersects line 7 can be described by one algebraic 
equation in the parameters a;,b;,c;,d;,a;,b;,¢;,d;. If we want our lines to 
have I intersection points, then we need to solve J equations in 4D variables. 
This naive parameter counting suggests that getting significantly more than 
40 intersection points requires some kind of structure or coincidence. A bit 
more rigorously, I believe that if we replace the set of lines by a “generic” 
4-parameter set of curves in R*, then no arrangement will have more than 
4L intersection points. 

Here is the philosophical question behind the many intersections problem. 
Morally, any arrangement with more than 4Z intersection points exists 
only because of some special structure in the set of lines. Now what 
special structures could the set of lines have? There is some structure from 
linear algebra. There is also some structure from polynomials and algebraic 
geometry. Are there any other ‘special structures’ of the set of lines in R?? 

In summary, some important examples in incidence geometry come from 
algebraic surfaces. It is interesting to ask whether all the examples come from 
algebraic surfaces. The polynomial method gives an approach to prove this 
type of statement in some cases. The main goal of Sect. 4 is to explain how 
this works. 


2. Perspectives on Polynomials 


In this section, we explore how this polynomial trick is connected to other 
parts of math. We will consider three other areas. The areas are diophantine 
problems in number theory, error-correcting codes in computer science, and 
surface area estimates in differential geometry. These areas give different 
perspectives on what makes polynomials special and useful functions. 


2.1 There Are Lots of Polynomials: Thue’s Work 
on Diophantine Approximation 


Let’s begin with a warmup problem. What is the smallest possible degree of 
a non-zero polynomial P € R{, y] that vanishes at the million points (j, 2) 
where j is an integer in the range [1, 10°]? 

The first approach one might try is to write down polynomials that 
vanish at the prescribed points. For example, we might try TS (a — j) or 


1A eae (y — 2’). Either of these options has degree 10°. We might try to craft a 


more clever formula that improves the degree. I don’t know how to write down 
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any explicit formula with degree < 10°. But the optimal degree is less than 
1,500. This follows by parameter counting, as in Sect. 1.1. The dimension of 
Poly, 49g (IR?) is ("3°°) > 10°, and so there is a non-zero polynomial of degree 
< 1,498 vanishing at all million points. This type of situation appeared in 
Thue’s work on diophantine equations and approximation. Here is Thue’s 
central result. 


Theorem 2.1 (Thue 1909). Suppose that 8 is an irrational algebraic 
number of degree d > 2. If p/q is any rational number, then 


|8 — p/q| > e(B)g7 2-11. 


As an immediate corollary, Thue proved that a huge class of diophantine 
equations in two variables have only finitely many integer solutions. For ex- 
ample, the following equations have only finitely many integer solutions. 


1. 2° — 2y3 =6. 
2. 24 + 1lay? + 17y* = 29. 
3. 2° + 2x7y3 + Gy? = 9. 


Thue’s corollary can be stated as follows: 


Corollary 2.2. If P(x,y) € Z|a,y] is a homogeneous polynomial of degree 
d > 3 which is irreducible, and if n is an integer, then the equation P(x, y) = 
n has only finitely many integer solutions (x,y) € Z?. 


Thue’s result was dramatically more general than any previous theorem 
about diophantine equations. To get a sense of how Thue’s diophantine 
approximation result implies the corollary, consider equation 1 above. 
Dividing through by y? we get (x/y)? — 2 = 6|y|~. This formula shows that 
a/y is a very good rational approximation of 2!/3. With a little manipulation, 
it follows that |2'/3 — (a/y)| < 100|y|~*. In contrast, Thue’s theorem on 
diophantine approximation says that |2'/? — (x/y)| > ely|~?>!. Comparing 
these inequalities, we see that |y| is uniformly bounded, and then it follows 
that there are only finitely many solutions. 

We will give a very partial sketch of Thue’s proof, and we will see how it 
connects with our warmup question about polynomials. 

Before Thue, the main theorem about diophantine approximation was 
Liouville’s theorem. 


Theorem 2.3 (Liouville 1840s). If 6 is an algebraic number of degree d > 1, 
and p/q is any rational number, then 


|B — p/q| > e(B)q~*. 


The idea of the proof is simple and we describe it here. By assumption, 
6 is a root of a degree d polynomial Q(x) € Z[z]. Since d is the minimal 
degree of such a polynomial, it’s not hard to check that Q(p/q) is non-zero. 
But Q(p/q) is a rational number with denominator gq“, and so |Q(p/q)| > q~@. 
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But Q(8) = 0, and since Q is minimal, it’s not hard to check that Q’(8) 4 0. 
So |Q(p/q)| has the same order of magnitude as |S — p/q|, and we see that 
|\8 — p/q| > e(8)q~¢. In rough terms, the polynomial Q “protects” 8 from 
rational approximations because Q(3) = 0 but Q(p/q) cannot be too small. 

Liouville’s theorem is not strong enough to prove finiteness for any 
diophantine equation. When d = 2, Liouville’s theorem is optimal, but for 
any d > 2, Thue was able to improve the exponent —d. Any improvement of 
the exponent in Liouville’s theorem implies the finiteness corollary. In other 
words, once we know that |8 — p/q| > c(8)q~“** for any € > 0, then Thue’s 
finiteness result follows. 

Thue had the idea to use other polynomials besides just Q in order 
to protect 6. Looking for other polynomials of one variable doesn’t turn 
up anything, but Thue had the remarkable idea to use polynomials of 
two variables. If P(x,y) € Zlx,y] is a polynomial of two variables that 
vanishes (maybe to high order) at (8,3), then P can “protect” 8 from 
pairs of good rational approximations (p;/q1,p2/q2). To prove that |2!'/3 — 
(x/y)| > cly|~?-°!, Thue requires an infinite sequence of auxiliary polynomials 
P;(x,y) € Z[z, y] which vanish at (2'/3, 2'/3) to different orders. Each of these 
polynomials protects (2!/3,2!/3) from rational approximations (p;/q1, p2/q2) 
in certain ranges, and working all together they provide enough protection 
to prove Thue’s theorem. 

Thue carefully by hand crafted this infinite sequence of polynomials 
P;(x,y). He was able to construct the desired polynomials by hand when 
B is a dth root of a rational number. He became stuck trying to generalize 
his method to other algebraic numbers, because he didn’t know how to 
construct the auxiliary polynomials. The problem of looking for these 
auxiliary polynomials is similar to our warmup problem. At a certain point, 
Thue gave up trying to craft the polynomials he needed. Instead, he proved 
that they must exist by counting parameters, essentially as we did above. 

At the 1974 ICM, Schmidt gave a lecture [Schmidt74] on Thue’s work 
and its influence in number theory. He wrote, 

The idea of asserting the existence of certain polynomials rather than 

explicitly constructing them is the essential new idea in Thue’s work. 

As Siegel [1970] points out, a study of Thue’s papers reveals that Thue 

first tried hard to construct the polynomials explicitly (and he actually 

could do so in case 8% is rational). 


This idea reminds me of the probabilistic method. Thue proved that his 
auxiliary polynomials exist using the pigeon-hole principle. No one knows 
how to give an explicit formula for these polynomials, but there are so many 
polynomials that some of them are guaranteed to work. 

Thue’s wonderful argument has many similarities to the proofs in Sect. 1. 
All the arguments have the following general outline. First, by counting 
parameters, we find a polynomial that vanishes at certain places. Second, we 
use basic facts about polynomials to understand what the polynomial does 
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at other places. Polynomials work in these arguments because they have a 
combination of rigidity and flexibility. Polynomials obey rigid properties like 
the vanishing lemma, which make them useful in the second step. On the 
other hand, there are lots of polynomials, which make them rather flexible in 
the first step. It’s somewhat remarkable that such a large space of functions 
obeys such rigid properties. 


2.2 The Resilience of Polynomials: Polynomials 
in Coding Theory 


The two main ingredients in the proofs of finite field Nikodym and joints 
are the parameter counting lemma and the vanishing lemma. This team of 
ingredients appeared together earlier in the theory of error-correcting codes. 
Dvir has a background in coding theory, and this circle of ideas may have 
influenced his proof of the finite field Nikodym conjecture. 

Let F, be a finite field with q elements, and let Poly p(F,) be the vector 
space of all polynomials over F, of degree < D. Because of the vanishing 
lemma, any two polynomials in Polyp(F,) can only agree at < D points. 
As long as D is much less than g, any two polynomials in Polyp(F,) look 
very different from each other. This makes them interesting tools for building 
error correcting codes. 

Here is a typical situation in coding theory. Q is a polynomial over Fy, 
of degree < q/1,000. We want to transmit or save Q, but the data gets 
corrupted, and instead we end up with a function fF : F, + Fy. Suppose we 
know that F(a) = Q(x) for at least (51/100)q values of x. Is it possible to 
recover Q from F'? 

It follows immediately from the vanishing lemma that @ can be recovered 
from F' in theory. Suppose that Q@, and Qs» are polynomials of degree < 
q/1,000 that agree with F for > (51/100)gq values of x. Then Q1 — Qz vanishes 
for at least (2/100)q values of x, and so Q1 — Qz is zero by the vanishing 
lemma. Hence there is only one polynomial Q € Poly,/; o99(Fq) consistent 
with the data F’. 

But there’s a deeper question that remains: can we recover Q from F' in 
a practical way? The argument above tells us that we can find Q by testing 
all the polynomials of degree < g/1,000—but the length of this procedure is 
more than exponential in g. In the mid-1980s, Berlekamp and Welch gave a 
polynomial-time algorithm to recover Q from F’ [BW86]. 

We consider the graph of F: the set {(a,y) € F7|F(x) = y}. This graph 
looks like a cloud of points. Inside the cloud of points a certain algebraic 
structure is hidden: most of the points lie on the graph of @. How can we 
search out this algebraic structure hidden in the cloud of points? 

The main idea of the algorithm is to find the lowest degree non-zero 
polynomial P(a,y) that vanishes on the graph of F. On the one-hand, we 
can find an optimal P with an efficient algorithm. On the other hand, this 
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optimal P uncovers the hidden algebraic structure in the cloud of points: 
looking at the zero set of P, the graph of Q jumps off the page. 

We begin by explaining how to find this optimal P. This discussion is 
closely connected to the parameter counting argument in Sect.1. Suppose 
we want to check whether there is a non-zero polynomial of degree < D 
that vanishes on a set S C F%. Let Polyp(F7) denote the space of all the 
polynomials with degree < D. Let Fcen(S,F,) be the vector space of all the 
functions from the set S to F,. This is a vector space of dimension |S]. Let 
R: Polyp(F7) — Fen(S,F,) be the restriction map which restricts each 
polynomial to the set S. The map R is a linear map between vector spaces, 
and it’s not hard to write down an explicit matrix for it. The basic operations 
of linear algebra can be done in polynomial time. We can check whether R has 
a non-trivial kernel, and if it does we can find a non-zero element in the 
kernel. Doing this for each degree D, we find in polynomial time a non-zero 
polynomial P that vanishes on the graph of F and has minimal degree. 

In the discussion so far, we treated the variables x and y on equal terms. 
Berlekamp and Welch actually treat them differently. This makes sense if we 
look back at the problem we’re trying to attack. We’re hoping to find the 
graph of Q, which is defined by y — Q(x) = 0. This defining equation has 
degree 1 in y and high degree in x. In order to adapt to the problem, it turns 
out to be a good idea to use polynomials P(x, y) of degree 1 in y and high 
degree in x. From now on we just consider polynomials P(x, y) = Po(x) + 
yP\ (x). By the same linear algebra argument, we can find such a polynomial 
P which vanishes on the graph of F', and where max(deg(P 5), deg(P,)) is as 
small as possible. 

We can also give an estimate for this degree. If we consider Po, P, of 
degree < D, then we get a vector space of polynomials of dimension 2D + 2. 
We want to find a polynomial that vanishes on the graph of F’, which has 
q points. As long as 2D + 2 > q, such a polynomial is guaranteed to exist by 
parameter counting. Therefore, we know that the degree of Po, P; is < q/2. 

Let’s summarize. We found a polynomial P(x, y) = Po(x) + yPi(y) which 
vanishes on the graph of F, where the degrees of Py and P, are as small 
as possible and definitely < g/2. This polynomial will help us to unlock the 
information hidden in F. 

The key point is that P vanishes on the graph of Q! This follows in a few 
simple steps. 


1. We know P = 0 on the graph of F’. In other words, P(x, F(x)) = 0 for all 
a: 

2. But we know that F usually agrees with Q. So P(x, Q(x)) = 0 for at least 
(51/100)gq values of x. 

3. But P(x, Q(x)) = Po(x) + Q(x)Pi(x) is a polynomial in x of degree < 
q/2 + q/1,000 < (51/100)q. 

4. By the vanishing lemma, P(x, Q(x)) is the zero polynomial! 
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We have proven that P(x, Q(x)) = Po(x) + Q(x)P, (2) is identically zero. 
Hence Q(x)P\(x) = —Po(a). We know Po and P;, and now we can recover Q 
by doing polynomial division. This is the Berlekamp—Welch algorithm. 

There is a more visual way of explaining how to recover Q, which makes 
the graph of Q jump off the page. We let the set of errors be FE := {x € 
F,|F (a) € Q(x)}. Adding a few more lines to the argument above, one can 
prove that the zero set of our polynomial P is the union of the graph of Q 
and a vertical line « = e at each error e € FE. Looking at the zero set of P, 
the set of errors is immediately visible, together with a large chunk of the 
graph of Q. From this large chunk of the graph of Q, we can quickly recover 
Q itself. 

Computer scientists working on error-correcting codes found a new set 
of questions about polynomials, very different from questions that pure 
mathematicians have considered. Working on these questions gave new 
perspectives about polynomials. Writing about coding theory in [Sudan95], 
Sudan referred to the resilience of polynomials: we can significantly distort 
the polynomial Q, but the information in Q survives. There is a lot more 
work on polynomials and coding theory. Some of it is described in [Sudan95] 
and in [Trevisan04]. The parameter counting lemma and the vanishing lemma 
continue to be important players. 


2.3 Efficiency of Polynomials: Polynomials in Geometry 


The last step of the proof of the distinct distance problem was influenced 
by ideas about polynomials in differential geometry. The overarching idea is 
that polynomials are geometrically efficient. 

We begin with an older result about the efficiency of complex polynomials. 
The zero sets of complex polynomials are minimal surfaces. Let’s formulate 
a precise result. We identify C” with R?” and equip it with the standard 
Euclidean metric. Let P be a complex polynomial C” — C. Let Z(P) denote 
the zero set of P. If the zero set of P does not contain any critical points of 
P, then Z(P) is a submanifold of real dimension 2n — 2. 


Theorem 2.4 ([Federer69]). Suppose that P : C"™ —+ C is a complex 
polynomial, and that F : R?” + R? is a smooth function, so that P = F 
outside of the unit ball B?” CR?” =C™. Also, suppose that Z(P) and Z(F) 
don’t contain any critical points, which implies that they are both manifolds. 
Then 


Volon—2 Z(P) AB < Volo, Z(F) 0 B™. 


This theorem says that complex algebraic surfaces do not waste any 
volume. 

In this section, we will be interested in analogous results for real polyno- 
mials. Initially, it seems that there can be no such result. The Weierstrauss 
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approximation theorem says that any continuous function on a compact 
subset of R” can be C° approximated by real polynomials. This basically 
means that real polynomials have no special properties at all. 

But if we slightly shift the question, there is an interesting theorem 
discovered only in the last 10 years. Instead of focusing on one polynomial 
at a time, we focus on the space Polyp(R”), the space of all polynomials of 
degree < D. Individual polynomials may be wasteful with volume, but we 
will see that the space Poly p5(R”) is efficient with volume. This follows from 
two results, one old and one new. 


Proposition 2.5. If P is a non-zero polynomial in Polyp(R”), then 
Voln_1 Z(P) NB" < C(n)D. 


This is a classical result. Because P is a degree D polynomial, a line 
can intersect Z(P) at most D times unless the whole line lies in Z(P). 
The Crofton formula describes how the volume of a hypersurface can be 
reconstructed in terms of the number of intersections between the surface 
and all of the lines in space. When we plug our estimate on the intersection 
numbers into the Crofton formula, it follows that the volume of Z(P)N B” 
is < C(n)D. 

Now comes the new result. Gromov compared Polyp(R”) with other 
vector spaces of the same dimension and saw that Polyp(R”) has approxi- 
mately the smallest zero sets. 


Theorem 2.6 ([Gromov03], see also [Guth09]). If W is a vector space of 
continuous functions B” + R, and if dimW = dim Polyp(R"”), then there 
exists F € W so that 


Vol,-1 Z(F) 9 B” > c(n)D. 


The proof uses a result from topology, but in some ways it is similar to 
the proof of finite field Nikodym or joints. A leading role is played by the fact 
that dim Polyp(R”) ~ D”. 

The contribution from topology is the Stone~Tukey ham sandwich 
theorem. The original ham sandwich theorem says that given three finite 
volume sets in R?, there is a plane that bisects all three. This theorem was 
proven by Banach in the late 1930s. Stone and Tukey generalized the result. 
For one thing, they generalized it to higher dimensions, but they did much 
more than that. They realized that the argument does not apply only to 
perfectly flat planes but also to many other families of surfaces. Stone and 
Tukey figured out the right way to formulate the theorem, making it much 
more general. The formulation is based on functions instead of hypersurfaces. 

We say that a continuous function F bisects a finite volume set U if the 
subset of U where F' > 0 has half the volume of U and the subset where 
F <0 has half the volume of U. 
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Theorem 2.7 (Stone-Tukey 1942). Suppose W is a vector space of continu- 
ous functions on a domain Q C R", so that for every non-zero F € W, Z(F) 
has measure 0. Let U,,...,Un CQ be finite volume sets, where N < dimW. 
Then there is a non-zero F € W which bisects each Uj. 


We can now sketch the proof of Gromov’s theorem. If there is a non-zero 
function F' € W so that Z(F) has positive (n-dimensional!) measure, then it 
has infinite (n — 1)-dimensional volume, and we are done. So we can assume 
that each Z(F’) has measure 0, and we can apply the Stone-Tukey ham 
sandwich theorem. Let U; be ~ D” disjoint balls in B” each of radius ~ D7. 
We choose a non-zero function Ff € W that bisects each ball. A classical 
result in geometry says that a surface bisecting a ball needs to have a certain 
minimal volume. In fact, the smallest bisecting surface is a disk through the 
center of the ball. 


Bisection lemma. I[f a hypersurface bisects B"(r), then it has volume at 
least c(n)r?—". 


Therefore, Z(F)NU; > c(n)D--, And Z(F)NB" > D?D-("-) = D. 
This finishes the sketch of Gromov’s estimate. 

These ideas from geometry/topology give a new twist to the polynomial 
method. Using linear algebra, we can find a non-zero polynomial P € 
Poly (R”) that vanishes on a set of points p1,...,pn as long as N < 
dim Poly,(R”). This fact plays a key role in the solutions of the finite 
field Nikodym problem and the joints problem. Now using the Stone—Tukey 
theorem from topology, we can find a non-zero polynomial P € Poly,(R”) 
that bisects some sets U),...,Un as long as N < dim Polyp)(R”). The proof 
of the distinct distance estimate uses this new twist. We will explain how to 
use it in Sect. 4. 


3. Some Methods and Problems in Incidence 
Geometry 


In this section, we describe the impact of the polynomial method in incidence 
geometry. We begin by recalling some important results and methods in the 
subject. Then we will come to the new applications of the polynomial method. 
We will try to motivate these results, and we will discuss why they are hard 
to prove with previous methods. 

This section motivates the results, and in the next section, we will discuss 
the proofs of these results. 


3.1 Incidence Theory in the Plane 


Suppose that £ is a set of lines in the plane. Let S,(£) be the set of r-rich 
points: the set of points that lie in > r lines of £. One of the basic questions 
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in the field is, “for a given number of lines and a given number of r, how 
big can S,(£) be?” This question was answered in a fundamental theorem of 
Szemerédi and Trotter. 


Theorem 3.1 ((ST83]). If £ is a set of L lines in the plane, then |S,(£)| S 
L?r-3 + Er. 


This theorem is a central result of incidence geometry. 
The first estimates about this problem exploit the following basic fact: 


Basic Fact. Two lines intersect in at most one point. 


Using just this fact and doing some counting arguments, we get some 
basic estimates. We call these estimates ‘basic’ because they follow just from 
the basic fact above. 


Basic estimate 1. |S,.(2)| < L?r7?. 


At each point of S,(£), there are C) pairs of lines intersecting. In total, 
there are only Cf pairs of lines, and each pair only intersects once. Therefore, 


[5-(2)| -< () em ~ L?r~?, Another short counting argument gives the 


following further estimate. 
Basic estimate 2. If r > 2L'/?, then |S,(£)| < L/r. 


These estimates are not as strong as the conclusion of the theorem. 
For example, if r = L'/?, then the theorem says that |S,(£)| < L!/?, but 
the basic estimates give only < L. 

There is a crucial example in the story showing that a proof of the 
Szemerédi—Trotter theorem requires some quite different ideas. The example 
involves lines over finite fields. Let F, denote the finite field with q elements. 
Let £ be the set of q? non-vertical lines y = ma +b, m,b € Fy. Each point of 
F? lies in q different lines of £. So we have |S,(£)| = q?. Since q = L'/?, we 
have |$,1/2(£)| = L. For L lines in R?, the Szemerédi—Trotter theorem gives 
the much better bound |$71/2(2)| < L'/?. Now it is still true in F? that two 
lines intersect in at most one point. Therefore, we cannot possibly prove the 
Szemerédi—Trotter theorem just by exploiting the fact that two lines intersect 
in at most one point. 

The main philosophical issue in the proof is to figure out what other 
information about lines in R? we can use. We need to use something that is 
true in R* but false in F7. There are several approaches to the problem, and 
in some way they all use the topology of the plane. 


3.2 The Cutting Method 


The cutting method was introduced by Clarkson, Edelsbrunner, Guibas, 
Sharir, and Welzl in [CEGSW90]. They used the method to give an elegant 
proof of the Szemerédi—Trotter theorem. They were also able to prove 
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incidence geometry results in higher dimensions. We will discuss this more 
below. Cutting plays a crucial role in the later applications of the polynomial 
method. 

We illustrate the cutting method by describing the main idea of the 
proof of the Szemerédi—Trotter theorem. The proof is a divide-and-conquer 
argument. We cut the plane into pieces using D red lines. Here D « L is 
a parameter we can play with, and the D red lines don’t have to be lines 
from £. The complement of the red lines consists of convex polygonal cells. 
The idea is that we use the basic estimates for the points and lines in each 
cell, and then sum up the pieces. This idea works well as long as the lines of 
£ and the points of $;.(£) are evenly distributed among the cells. 

Let’s be a little more precise about what we may hope for. The D red lines 
cut the plane into ~ D? cells. If the points were evenly distributed among 
the cells, we would have the following: 


Equidistribution 1. Each cell contains < |S,(£)|D~? points of S,.(£). 


Now a line may enter at most D+ 1 cells, because it can only cross each 
red line once. Since there are ~ D? cells, each line enters only a small fraction 
of the cells. If the lines were evenly distributed among the cells, we would 
have the following 


Equidistribution 2. Each open cell intersects < LD~! lines of £&. 


If we are allowed to choose any D, and find D red lines that evenly 
distribute $,.(£) and £, then using the basic estimates in each cell and adding 
the results we get the conclusion of the Szemerédi—Trotter theorem. In fact, 
we don’t need to evenly distribute both S,(£) and £—either one will suffice. 
We state this precisely as a proposition. 


Proposition 3.2. Let £ be a set of L lines in the plane and fiz some r. 
Let i =1 or 2. Suppose that for any1l< D<L, we can find D lines cutting 
the plane into ~ D? cells so that Equidistribution i holds. Then |S,(£)| < 
E?r-3 + Er, 


The proof of this result is just a calculation. When I first did this 
calculation, I thought I had understood the main idea of the proof of 
Szemerédi—Trotter. Getting the points or lines to evenly distribute among 
the cells seemed like a minor point to me. My wrong intuition went like 
this: if I just put down the dividing lines without thinking too much, then 
the points wouldn’t have a reason to concentrate in any particular cells, so 
they would probably end up pretty evenly distributed. With a little more 
experience, I think that this intuition was totally wrong. 

Here’s an alternate perspective. If I choose D red lines, then I have 2D 
real parameters at my disposal. I would like each of D? cells to contain ~ 
|.S;.(£)|/D? points of S,(£). Iam trying to satisfy ~ D? conditions. In essence, 
I have 2D variables, and I am hoping to solve D? equations. Without other 
information, this is a plan that sounds unlikely to work. 
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Here’s an example of a set of points which is impossible to equidistribute. 
Take any set of points lying on a closed convex curve in the plane. Each red 
line intersects the curve in at most 2 points. Therefore, D red lines cut the 
curve into < 2D pieces. It follows that most of the ~ D? cells contain no 
points of the set. 

This divide-and-conquer plan actually does work, driven by one further 
crucial idea. The crucial idea is to choose the D red lines independently at 
random from among the lines of £. If we do this, the lines of £ interact with 
the red lines in a good way, and we get something close to Equidistribution 2. 
We briefly give intuition why this may work. Suppose that we first randomly 
pick D/10 red lines from the lines of £ and look at the resulting cells. If one 
of these cells contains > 100L.D~' lines of £, then it is very likely that one of 
them will be chosen among the next D/10 red lines, and the cell will get cut 
into smaller pieces. Cells intersecting more than 100L.D~! lines have a brief 
half-life, and this suggests that at the end of the process almost all cells will 
intersect < LD~* lines of £. This gives (a bit of) the flavor of the random 
line argument. We have left out some important details. The cutting method 
involves some further care, and the random cutting needs to be refined a 
little. But choosing a random subset of D lines from £ is a crucial first step. 


3.3 Problems in Higher Dimensions 


Generalizations of the Szemerédi—Trotter theorem are a central subject of 
incidence geometry. One natural direction is to work in higher dimensions. 
Instead of lines in the plane, we can consider k-planes in R”. Some of the 
proofs of the Szemerédi—Trotter theorem are very planar, and it is difficult to 
generalize them to R” for n > 3. For example, [Székely97] gives a beautiful 
proof of the theorem using crossing numbers of graphs. This proof generalizes 
to a huge variety of problems in the plane, but it seems very difficult to 
generalize it to higher dimensions. The cutting method was invented partly 
in order to attack higher-dimensional problems. 

Let’s summarize how to adapt the method to higher dimensions. The gen- 
eral divide-and-conquer strategy still makes sense. To divide R” into cells, 
we need D red hyperplanes instead of D red lines. They divide R” into 
~ D” cells. If we have some kind of equidistribution, we still get interesting 
estimates. Moreover, if we are studying a set of (n — 1)-dimensional planes 
in R”, then we can randomly choose D hyperplanes from our set, and we 
get some type of equidistribution. The objects don’t necessarily have to 
be planes—we can also study codimension 1 spheres, paraboloids or other 
shapes. 

But if we are studying k-planes in R” for k < n—1, then there is a major 
difficulty: k-planes do not divide R” into cells. If we try to choose (n — 1)- 
planes so that the k-planes are equidistributed among the cells, we cannot 
use the key random trick above. We are stuck with ~ D parameters hoping 
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to satisfy ~ D” conditions. Moreover, there are examples of arrangements of 
k-planes in R” where no arrangement of hyperplanes gives equidistribution. 
These examples generalize the set of points on a convex curve described 
above. 

In summary, there is a major obstacle in dealing with objects of 
codimension > 1. The joints problem is one of the simplest incidence problems 
in codimension > 1. That’s one reason the joints problem is interesting 
and important. Following the joints theorem, it looks reasonable to use the 
polynomial method to attack other incidence problems in codimension > 1. 
We will see a number of results in this direction. 

Before the polynomial method, I only know of one sharp estimate about 
incidences in codimension > 1. This is Toth’s complex generalization of the 
Szemerédi-Trotter theorem [Toth03]. If £ is a set of L complex lines in C?, 
Toth proved that |S,.(£)| < L?r~3+ Lr~!—the same estimate as for real lines 
in R?. From the point of view of topology, C? is homeomorphic to R* and 
the complex lines are homeomorphic to R?, and so in a topological sense the 
codimension is 2. Toth’s proof is adapted from the first proof of Szemerédi 
and Trotter, and it is technically difficult. 

In his work on the complex problem, Toth raised the following question. 
Suppose that £ is a set of k-planes in R?*, and that any two k-planes 
of £ intersect in < 1 point. (In other words, we forbid two k-planes to 
contain a common line.) Is it still true that the number of r-rich points 
is < L?r~3 + Lr-!. This is a bold higher-dimensional generalization of 
the Szemerédi-Trotter theorem (and it also includes the complex version of 
the Szemerédi-Trotter theorem). Recently, Solymosi and Tao proved Toth’s 
conjecture up to a factor of L* using the polynomial method. 


Theorem 3.3 ([Solymosi—Taol12]). If £ is a set of L k-planes in R?*, and 
if any two planes of & intersect in < 1 point, then for any € > 0, the number 
of r-rich points of & is < C(6)L*(L?r—3 + Lr). 


3.4 Distance Problems in the Plane 


There are many deep open problems in incidence geometry even for curves in 
the plane. One example is the unit distance problem (which Erdés’s posed 
in [Erdés46] alongside the distinct distance problem). It asks, given n points 
in the plane, how many pairs of points can have distance 1? In all known 
examples, the number of unit distances is < n!**. (In a square grid with 
a well-chosen spacing, the number of unit distances is slightly superlinear, 
but < n't for any « > 0.) The paper [SST] gives the best currently 
known bound: the number of unit distances is < n4/3. This bound is closely 
connected with the Szemerédi—Trotter theorem. The unit distance problem is 
analogous to the Szemerédi—Trotter problem with unit circles in place of lines. 

The reason for the difficulty seems to be the following. If we replace unit 
circles by “unit parabolas” (parabolas of the form y = x? + ax +b), then the 
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bound n4/3 is tight. To improve the n*/? bound, we need to find and use a 
property that is true for unit circles and false for unit parabolas. There’s no 
clear candidate for this property or how to use it. 

The distinct distance problem can also be phrased as a problem about 
circles in the plane, and it is difficult for similar reasons. 

Elekes found a completely different way of thinking about the distinct 
distance problem, connecting it to problems in higher codimension like the 
ones we discussed in the last section. 


3.5 Partial Symmetries 


Suppose G is a group acting on a space X. If P C X is a finite set, then we 
can look at the symmetries of P under the group action. We define 


G(P) := {g € G such that g(P) = P}. 


Elekes started a study of partial symmetries. A partial symmetry of P is 
a group element that maps a large chunk of P to another large chunk of P. 
More precisely we define the r-rich partial symmetries by 


G,(P) := {9 € G such that |g(P)M P| > r}. 


It’s interesting to try to understand the size and structure of G,(P) 
in different situations. Elekes realized that this natural problem is closely 
connected to the distinct distance problem and to the incidence geometry of 
curves in 3-dimensional space. In these connections, the group G is the group 
of orientation-preserving rigid motions of the plane. 


Conjecture 3.4 ((Elekes-Sharirl0]). If P is a finite set in the plane, and 
r > 2, then 


IG-(P)| S|PPr*. 


(If P is a square grid, then this bound is tight up to a constant factor for 
all2 <r <|P|/10.) 

Elekes and Sharir proved this conjecture for r = 3 using the polynomial 
method. Nets Katz and I proved the conjecture in [Guth—Katz1]]. 

This conjecture is closely related to the distinct distance problem. Elekes 
realized that if a set P has few distinct distances, then it must have lots of 
partial symmetries. We sketch the reason. Let Q(P) be the set of distance 
quadruples, defined as follows. 


Q(P) = {(p1, U1, p2, q2)|dist(p1, q1) = dist(p2, q2)}. 


If there are few distinct distances, then it stands to reason that there 
will be many pairs of points at the same distance. By a Cauchy—Schwarz 
argument, one gets |d(P)||Q(P)| = |P|*, where d(P) is the number of distinct 
distances of the set P. So if there are few distinct distances, then |Q(P)| will 
be large. 
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On the other hand, each quadruple in Q(P) suggests a partial symmetry 
of P. For each quadruple of Q(P), there is a unique rigid motion g € G so 
that g(pi) = p2 and g(qi) = q. The rigid motion takes two points of P to 
two other points of P, so it belongs to Go(P). In this way, we get a map 
E : Q(P) > G2(P). We want to use this map to count Q(P). If the map E 
were injective, we would have |Q(P)| < |G2(P)|, which in turn is < |P|?. The 
map F is actually not injective. If |g(P) P| =r, then the preimage E~'(g) 
has size ~ r?, because there are (5) pairs of points in g(P)NP, and each pair 
yields a distance quadruple. Based on this observation, it’s straightforward 
to relate Q(P) and G,(P): 


|P| 


JQP) ~ YoriG-(P)I. 


r=2 


Plugging in the Elekes—Sharir conjecture gives |Q(P)| < yt |P|er-t w~ 
|P|? log|P|, and so |d(P)| = |P|/log|P|. So the Elekes-Sharir conjecture 
implies the new bound for the distinct distance problem. 

The next observation of Elekes is that understanding the size of |G,.(P)| is 
an incidence geometry problem where the background is the group G instead 
of Euclidean space. Instead of lines in R*, we consider the following special 
curves in G. For any two points p,, p2 € R?, define 


Sip. = {9 € G such that g(p1) = pe}. 


These curves are natural objects from the point of view of the group structure 
of G. The curves Sp, », are 1-dimesional subgroups of G, and the curves Sp, jp, 
are their cosets. 

For a finite set P C R?, let S(P) denote the |P|? curves {$,, ».bp,p.eP- 
Next, we observe that a group element g is in G,(P) if and only if g lies in 
> r of the curves of S(P). This follows directly from the definition. If g is 
in G,(P), then it means that g : P,; — Py» bijectively, where P; and P» are 
subsets of P with size r. For each point pi € Pi, we have g € Sp, g(p,); 80 9 
lies in r curves of S(P). The converse direction is similar. So we can redefine 
G,(P) in the following way: 


G,(P) = {glg lies in > r curves of S(P)}. 


Understanding the size of G,(P) is closely analogous to understanding the 
number of r-rich points of a set of lines in R°. In particular, both problems 
involve objects of codimension 2, and they involve the difficulties discussed 
in Sect. 3.3. 

In the future, mathematicians may consider the incidence theory of 
subgroups and cosets inside of a Lie group G by working intrinsically inside 
of G. For the time being, we are much more comfortable in Euclidean space, 
and we choose coordinates on G so that we get a problem about curves 
in Euclidean space. In the particular case of our curves S(P) in our group 
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G, there is a good choice of coordinates where the curves become straight 
lines in R®. In these coordinates, the Elekes—Sharir conjecture reduces to the 
following two theorems about straight lines in R?. The theorems were proven 
in [Guth-Katz1]]. 


Theorem A. Suppose that £ is a set of L lines in R® with < L'/? lines in 
any plane or degree 2 surface. Prove that the number of intersection points 
of lines of & is < L/?. 


In particular, this theorem gives the best known estimate on the many 
intersection problem that we discussed in Sect. 1. 


Theorem B. Suppose that & is a set of L lines in R° with < L‘/? lines in 
any plane. For 3 <r < L'/?, prove that the number of r-rich intersection 
points of & is < L3/?r-?, 


T like to think of this theorem as a generalization of the Szemerédi—Trotter 
theorem to lines in R?. There are probably many generalizations of that 
theorem to higher dimensions. Toth’s conjecture is one generalization, 
and Theorem B is another generalization with a different flavor. 


3.6 Conclusion 


Studying incidence geometry problems in codimenson > 1 presents particular 
challenges. The polynomial method is the most effective tool currently 
available for studying these problems. The simplest case is the case of lines 
in R°. For lines in R°, the joints theorem and Theorems A and B give a good 
picture of what we now understand. The many intersections problem is a 
good example of what we still don’t understand. In higher dimensions, the 
Solymosi—Tao result on the Toth conjecture is the main example of what we 
now know. This result is remarkable (partly) because it works with arbitrarily 
high dimensions and arbitrarily high codimensions. 

Several problems can be transformed into incidence geometry problems 
in higher codimension. We have seen that the distinct distance problem in 
the plane and the partial symmetries of plane sets are both related to the 
incidence structure of lines in R®. 

In the next section we will describe how to attack these problems using 
high-degree polynomials, extending the ideas from the proofs of finite field 
Nikodym and joints. 


4. Combinatorial Structure and Algebraic Structure 
In this section, we will discuss the proofs of Theorems A and B. The proofs 


are based on the polynomial method, and the key point is the connection 
between combinatorial structure and algebraic structure. 
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We saw earlier that lines in R? may have many intersection points 
by clustering into either a plane or a degree 2 surface. Theorem A is a 
(partial) converse to this observation: more than L2+€ intersection points 
may be formed only if the lines cluster into a plane or a degree 2 surface. 
Theorem A says that a certain combinatorial structure forces a certain 
algebraic structure. Our goal in this section is to explore how combinatorial 
structure can force algebraic structure. 

We will see two different mechanisms how combinatorial structure can 
force algebraic structure. We begin by considering what we mean by algebraic 
structure. 


4.1 Algebraic Structure for Finite Sets 


If X C RY”, let deg(X) be the smallest degree of a non-zero polynomial 
that vanishes on X. We have seen that for a finite set X, deg(X) < |X|”. 
Of course particular finite sets can have much lower degree. For instance, any 
subset of a plane has degree 1. For generic sets, the |X|!/" bound is sharp. 
So a generic finite set has deg(X) ~ |X|1/". Any set with degree significantly 
smaller than |X|!/" has non-trivial algebraic structure. 

There is a similar discussion for finite unions of lines. If X is a union 
of L lines in R”, then deg(X) S L*-1, The proof is straightforward, so we 
sketch it here. Suppose that D is a degree so that (D+1)L < dim Poly, (R”). 
Then we can choose a non-zero polynomial of degree < D that vanishes at 
D-+1 points on each of the L lines. By the vanishing lemma, this polynomial 
vanishes on each line. A short calculation shows that we can choose D < 
L=7. In summary, any union of L lines has degree < L*-1, If a union of L 
lines has degree significantly smaller than this, then it has some non-trivial 
algebraic structure. 

With this definition of algebraic structure, we can begin to explore how 
combinatorial structure forces algebraic structure. 


Proposition 4.1 (Degree reduction). Suppose that £ is a set of L lines 
in R°. Suppose that each line contains > A (distinct) intersection points with 
other lines of £. Then the degree of the union of the lines is < 10°L/A. 


(If A < L'/?, then the conclusion of the proposition is worthless, because 
every set of L lines has degree < 4L'/? anyway. But if A is much bigger than 
L‘/?, then the lines have non-trivial algebraic structure.) 

Here is the idea of the proof. We saw above that for any L’ lines of £, 
there is a non-zero polynomial which vanishes on those lines with degree 
< 10(L')'/?. We let £/ C £ be a subset of L’ random lines of £, and we 
consider the polynomial P that vanishes on them. If A and L’ are large 
enough, then this polynomial has to vanish on many other lines. Let / be 
another line of £. If / intersects the lines of £’ at > deg(P) points, then P 
will vanish on | also. The expected number of intersection points between | 
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and the lines of £/ is A(L’/L). Whenever A(L’/L) > 100(L’)'/?, the expected 
number of intersection points is > 10 deg(P). In this situation, the polynomial 
P will vanish on the vast majority of the lines of £. Choosing L’ optimally, we 
get a polynomial of degree < 10°L/A that vanishes on most of the lines of £. 
(And with a little extra technique, we can get a polynomial that vanishes on 
all of the lines of £.) 

I think this proposition is fundamental to the polynomial method. 
It shows that a set of lines with a lot of intersections must have an algebraic 
structure. This algebraic structure is an important clue to try to understand 
such sets of lines. Once we know that the set of lines has a non-trivial 
algebraic structure, it’s natural to try to use algebra and algebraic geometry 
to understand the set better. 


4.2 Ruled Surfaces 


The proof of Theorem A is based on the theory of ruled surfaces. An algebraic 
surface Z(P) C R° is called ruled if each point of Z(P) lies in a line in Z(P). 
If is called doubly ruled if each point of Z(P) lies in two different lines in 
Z(P). There is a classification of doubly ruled surfaces, and in particular the 
following result is relevant for us. 


Proposition 4.2. A doubly ruled algebraic surface Z(P) C R? is a union of 
planes and degree 2 surfaces. 


Theorem A is a discrete analogue of this proposition from algebraic 
geometry. To try to make the analogy as close as possible, we state a small 
variation of Theorem A. 


Theorem A’. Suppose that £ is a set of L lines in R?, and that each line 
contains > 10!°L'/? intersection points with other lines of 2. Then the lines 
of & are contained in a union of 107>L'/? planes and degree 2 surfaces. 


In this analogy, the set of intersection points of the lines of £ is a ‘discrete 
approximation of a surface’. Each of these points lies in two lines of £, and 
each line of £ contains many points of our ‘discrete surface’. The hypothesis 
is that we have a kind of ‘discrete doubly ruled surface’, and the conclusion 
is that £ is contained in a union of planes and degree 2 surfaces. 

The degree reduction argument is a first step to prove Theorem A’. It tells 
us that the lines of £ are contained in the zero set of a polynomial P of degree 
<10->L'/?. This is the right bound for the degree, but we still have to prove 
that the polynomial factors into polynomials of degree 1 and 2. We have to 
understand better the structure of the polynomial P. We will see that the 
combinatorial structure of the lines of £ is connected with the geometric 
structure of Z(P). We explain the connection in the next subsection. 
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4.3 Contagious Structures 


Suppose that / is a line in Z(P). If there are > deg(P) critical points of P on 1, 
then every point of / is critical. The property of being critical is ‘contagious’. 

Let’s give another example of a contagious property. Suppose now that 
1c Z(P) and that each point of | is non-critical. A regular point x in Z(P) 
is called flat if the curvature of Z7(P) vanishes at x—equivalently if there is a 
plane thru x which is tangent to Z(P) to second order. If the line / contains 
more than 3deg(P) flat points, then every point on the line is flat. So being 
flat is also a contagious property. 

These properties are contagious because they are described by (other) 
polynomials. A point is critical if and only if 0,P, 02P, and 03P all vanish. 
These partial derivatives have degree < deg(P)—1. It follows by the vanishing 
lemma that being critical is contagious. With a little more work, being flat is 
also described by polynomials. For any polynomial P, there exists a finite list 
of polynomials SP with degree < 3deg(P), and a (regular) point x € Z(P) 
is flat if and only if SP(«) = 0. It doesn’t take that much work to construct 
SP, and then we see that being flat is contagious too. 

To see how to use contagious properties, we will begin by discussing triple 
intersection points, because the method is a little easier. Suppose that L is 
a set of L lines in R® and each line contains > 10!°L!/? triple intersection 
points. By degree reduction, these lines lie in Z(P) for a polynomial P of 
degree < 1075L1/2. Since the number of triple points on each line is much 
more than the degree, any contagious property of the triple points will spread 
to all of the lines. 

Triple intersection points indeed have interesting properties. If x lies in 
3 lines in Z(P) and the lines are not coplanar, then wx is a critical point of 
P, as we saw in the proof of the joints theorem. On the other hand, if x is 
not a critical point and « lies in three lines of Z(P), then z is a flat point. 
The three lines must lie in the tangent plane of Z(P), and then the tangent 
plane hugs Z(P) along three lines, which forces it to be tangent to Z(P) to 
second order. Anyway, every triple intersection point is either critical or flat. 
Since these properties are contagious, every point in the union of the lines of 
£ must be either critical or flat. 

Contagious properties don’t just spread from points to lines. If there are 
many lines with a contagious property, then it can spread to a whole surface. 
This follows from the following version of Bezout’s theorem. 


Theorem 4.3. If P and Q are polynomials in three variables, and if they 
have no common factor, then Z(P)N Z(Q) contains at most deg(P) - deg(Q) 
lines. In particular, if P is irreducible and Z(P)M Z(Q) contains > deg(P) - 
deg(Q) lines, then Q vanishes on Z(P). 


Remember that our L lines lie in Z(P) where deg(P) < 1075L1/?. Each 
of the lines is either critical or flat. Suppose for a moment that they are all 
flat. (The critical case is similar.) For the sake of exposition, let’s also assume 
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that P is irreducible. The number of flat lines is L > 10'°(deg(P))?. Each 
polynomial of SP vanishes on these lines. The degree of each polynomial of 
SP is < 3deg(P). By the Bezout theorem, SP vanishes on Z(P). This means 
that every point of Z(P) is flat. Then it follows that Z(P) is a plane. 

In general, the polynomial P may be reducible and there may be several 
components, but a similar argument shows that they are all planes. We have 
sketched the proof of the following result, which essentially appears in 
[EKS11]. 


Theorem 4.4 ([EKS11]). Jf £ is a set of L lines in R?, and each line 
contains > 10!°L'/? triple intersection points, then the union of the lines 
is contained in < 10~>L1/? planes. 


This theorem is the case r = 3 of Theorem B. 

It is harder to understand intersection points than triple intersection 
points. The problem is that if x lies in two lines in Z(P), then it doesn’t 
imply that x is either critical or flat. It’s not clear right away if there is 
another contagious property that we can use instead. 

To approach this question, let’s step back and try to understand where 
contagious properties come from. We can build contagious properties by 
looking at polynomials in P and the derivatives of P. If RP is a polynomial 
of degree < C in P and its derivatives, then RP(x) is a polynomial in x of 
degree < Cdeg(P). We can use any such RP in place of SP in the argument 
above. Algebraic geometry helps understand what geometric properties of 
Z(P) at a point x can be described by some polynomial equations in P and 
its derivatives. In short there are a lot of contagious properties. 

We give one more example. A point « € Z(P) is called flecnodal if and 
only if there is a non-zero vector v so that P vanishes in the direction v to third 
order at x. It’s not immediately obvious that being flecnodal is contagious, 
but it is. There is a polynomial F’P, called the flecnode polynomial, of degree 
at most 11 deg(P), and a point « € Z(P) is flecnodal if and only if FP(x) = 0. 
This polynomial and this result were discovered by Salmon in the 1800s. 

Stepping back from the details, we can describe the moral of the proof of 
Theorem A’. If « lies in two lines in Z(P), it leads to some equations about 
P and the derivatives of P at x. These equations are all contagious, and so 
they end up holding at every point of Z(P). So all the points of Z(P) have 
a lot in common with the intersection points. After working out the details, 
it follows that every point of Z(P) lies in two lines in Z(P). The surface 
Z(P) is doubly ruled. By the classification of doubly ruled surfaces, Z(P) is 
a union of planes and degree 2 surfaces. We also know that the degree of P 
is < 1075L/2. Hence all the lines of £ lie in < 10~>L!/? planes and degree 
2 surfaces. 
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4.4 Polynomial Cell Decompositions 


Theorem B involves a combination of all of the difficulties we have encoun- 
tered in this essay so far. It is a problem about lines in R?, so the codimension 
is > 1. This suggests that the proof needs to use high degree polynomials. 
We saw in the last section how to prove the case r = 3 with the polynomial 
method, and I don’t have any idea how to approach the problem without it. 
But for large r, Theorem B is false over finite fields like the Szemerédi—Trotter 
theorem. This suggests that the proof needs to use the topology of R°. 

The proof of Theorem A does not generalize to Theorem B. It breaks 
down in the very first step: the degree reduction argument does not work. 

The proof of Theorem B involves a combination of (almost) all of the 
methods that we’ve discussed in this essay. The key step is to build cell 
decompositions using polynomial surfaces, combining the cutting method and 
the polynomial method. 

Instead of cutting space with D hyperplanes, we cut space with a degree D 
polynomial. A degree D polynomial surface has many good features in 
common with a union of D hyperplanes. The complement of D hyperplanes 
consists of ~ D” components. The complement of a degree D polynomial 
surface consists of S D" components, and there are ~ D” components in 
many examples. We will call these components cells. In each case, a line can 
only enter at most D+ 1 cells. 

The union of D hyperplanes is a special case of a degree D polynomial 
surface, but there are many more polynomial surfaces. Using polynomial 
surfaces gives us much more flexibility, and we have a better chance to 
prove equidistribution. Recall that we would like some equidistribution among 
~ D” cells, which means we are trying to achieve ~ D” conditions. Choosing 
D hyperplanes gives us ~ D degrees of freedom. But choosing a degree D 
polynomial surface gives us ~ D” degrees of freedom. Having so much more 
freedom, it looks more realistic to get equidistribution. Here is a precise result 
about building cell decompositions with polynomial surfaces. 


Lemma 4.5 (Polynomial cell decomposition lemma). If G is any finite 
set in R”, and if D > 1 is any integer, then there is a non-zero polynomial 
P € Polyp(R") so that each component of the complement of Z(P) contains 
< C(n)|G|D~” points of G. 


We should make an important caveat right away. The lemma does not 
say that all the points of G are in the complement of Z7(P). Some or even all 
the points of G could lie in Z(P). 

The proof of the cell decomposition lemma is based on the Stone~Tukey 
ham sandwich theorem, which we discussed in Sect. 2.3. The ham sandwich 
theorem allows us to cut a bunch of sets in half. By using it repeatedly, we 
can cut our set of points into halves, then quarters, then eighths...Here is a 
detailed sketch. 
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1. The ham sandwich theorem says that given N finite volume open sets, we 
can choose a polynomial of degree < N!/” that bisects all of them. 


We are dealing with finite sets, which have volume zero. Suppose that 
we have WN finite sets S1,...,S5.. We let U; be the e-neighborhood of $j. 
We apply the theorem to U; and take the limit as € goes to zero. In this way 
we get the following more combinatorial result. 


2. If S1,..., Sn are finite sets, then there is a polynomial P of degree < N!/” 
so that P > 0 on at most half the points of S; and P < 0 on at most 
half the points of S;. (Remark: P might vanish on some or even all of the 
points of S;.) 

3. We have a set S that we want to divide into 2/7 fairly even pieces. Pick a 
plane that bisects S. Then pick a surface that bisects each half, leaving us 
with four sets of cardinality at most |G|/4. Next pick a surface that bisects 
each of these four sets. Continuing in this way, we have cut 6 into 27 pieces 
of cardinality at most |G|2~7 by a union of J algebraic hypersurfaces. 
The degrees of these hypersurfaces are bounded by step 2, and adding up 
we get a total degree < 27/” as desired. 


Next we discuss how to use the polynomial cell decomposition lemma. 
We consider an arrangement of lines £, and we let G be the set of r-rich 
points. We build a polynomial cell decomposition. If all the points of G lie in 
the cells, then we can proceed by a divide-and-conquer argument as in the 
cutting method. We know that each cell has the same number of points of 
6, and we know the number of lines that enter an average cell. In each cell, 
we can use a more elementary method to count r-rich points. Adding up the 
contributions from all of the cells, we see that the number of r-rich points is 
< L9/?7-?the conclusion of Theorem B. 

This is not a complete proof of Theorem B. It may happen that most or 
all of the points of G lie in Z(P), and then the argument breaks down. Here 
is a slightly more optimistic way of looking at the situation. 

The polynomial cell decomposition argument gives a second, completely 
different mechanism by which combinatorial structure forces algebraic struc- 
ture. If £ is a set of L lines with significantly more than L°/?r7? r-rich 
points, then the argument above shows that almost all of the r-rich points 
lie in Z(P) for a polynomial P of surprisingly low degree. Since there are 
many r-rich points on each line, it follows that the lines lie in Z(P) also, and 
the conclusion is that the degree of £ is far below L!/?. The combinatorial 
structure of having many r-rich points forces algebraic structure. 

Once the set of lines has algebraic structure, the rest of the proof of 
Theorem B is similar to the proof of Theorem A, using contagious properties. 

The polynomial cell decomposition has had several other applications. 
The paper [Solymosi—Taol2] uses it to prove the higher-dimensional general- 
ization of the Szemerédi—Trotter theorem. The paper [KMS12] uses it to give 
new proofs and perspectives on several fundamental theorems of incidence 
geometry. 
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4.5 Final Summary 


The proofs we have been studying get off the ground by proving that 
arrangements with a lot of combinatorial structure must have unexpectedly 
low degree. We have seen two mechanisms to find these unexpectedly low 
degree polynomials. One mechanism is the degree reduction lemma. This 
lemma is proven by combining the parameter counting argument and the 
vanishing lemma. It’s based on the proof of the finite field Nikodym conjecture 
and recovery algorithms for error-correcting codes. The second mechanism 
is the polynomial cell decomposition method. This mechanism is based on 
the polynomial method, but also on the cutting method and surface area 
estimates from differential geometry. 

Once we know that the arrangement we are studying lies in the zero set 
of a polynomial of unexpectedly low degree, then it’s natural to try to use 
that polynomial to study the set. The contagious structures are one tool to 
do that. 
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Summary. We investigate the maximal number S(P,n) of subsets of a set of n 
elements homothetic to a fixed set P. Elekes and Erdés proved that S(P,n) > cn? if 
|P| = 3 or the elements of P are algebraic. For |P| > 4 we show that $(P,n) > cn? 
if and only if every quadruple in P has an algebraic cross ratio. Moreover, there is 
a sequence S,, of numbers such that S(P,n) < S, whenever |P| = 4 and the cross 
ratio of P is transcendental. 
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Let C denote the set of complex numbers. We say that the sets A, B Cc C 
are homothetic, and write A~ B, if B=a-A+b= {ax+b: a © A} with 
suitable a,b € C, 4 0. If P and A are finite subsets of C then let 


s(P, A) 2X CA: X~ PHI, 


where |H| denotes the cardinality of H. In [2] G. Elekes and P. Erdés 
investigated the behaviour of the sequence 


It is easy to see that S(P,n) < 2n(n—1) holds for every finite P andn EN. 
Elekes and Erdés proved that the order of magnitude of S(P,n) is close to 
n? for every P; namely 


S(P, n) > ¢: n2—e log * n 


holds for every n > |P| with positive constants a,b, and c depending on P 
but not on n. They also showed that 
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S(P,n)2e-n® (n> |P|) (1) 


if |P| = 3 or if the elements of P are algebraic, and asked whether or not 
this is true for every finite P. In this paper we answer this question in the 
negative, and characterize the sets satisfying (1). 

Our characterization will be given in terms of the cross ratio and 
projective equivalence. The cross ratio of the distinct complex numbers a, 
b, c, d is defined by 

def C-a d—a 
(a; b;c;d) = mob das 
A map f: A> B(A,B CC) is said to be projective if it preserves the cross 
ratios of the quadruples of A. Two sets are projective equivalent if there 
is a projective bijection between them. Note that P and P’ are projective 
equivalent whenever |P| = |P’| < 3. Our main result is the following. 


Theorem 1. For every finite P the following are equivalent. 


(i) There is a positive constant c such that (1) holds. 
(ii) The cross ratio of every quadruple of P is algebraic. 
(itt) There is a set P’ such that P and P’ are projective equivalent, and the 
elements of P’ are algebraic. 


First we prove the implication (ii) ==> (iii). Suppose that the set P = 
{a1,...,a,} satisfies (ii), where a1,...,a% are distinct complex numbers and 
k > 4 (if k < 3 then the statement is obvious). Let f(x) = (px + q)/(raz +8) 
(x € C), where p,q,7r,s are chosen such that ps — gr # 0, ra; + s # 0 for 
i = 1,...,k, and f(a;) is algebraic for i = 1,2,3. Let P’ = f(P). Since 
f is projective, P and P’ are projective equivalent. For the same reason, 
the cross ratio of the numbers f(a1), f(a2), f(a3), f (ai) is algebraic for every 
4 <i<k, Since the first three of these numbers are algebraic, so is f(a;). 
Thus the elements of P’ are algebraic, and hence P satisfies (iii). 

The implication (iii) => (i) is an immediate consequence of the following 
theorem and of the result of Elekes and Erdés stating that (1) holds if the 
elements of P are algebraic. 


Theorem 2. Let |P| = |P’| =k, and suppose that P and P’ are projective 
equivalent. Then there are positive constants c, and cg depending only on k 
such, that 


a+ SP) = SP") <co+ S(P, 0) (2) 
for every n> k. 


Lemma 1. For every Q C C, |Q| = k andn > k we have S(Q,kn) < 
ESO). 


Proof. Let E C C be such that |E| = kn and s(Q,E) = S(Q,kn). Let N 
denote the number of pairs (C, D) such that CC DC E,C ~ Qand|D| =n. 
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Clearly, N = S(Q, ol ers On the other hand, as each D contains at most. 


n—- 


S(Q,n) C’s, we have N < (*") 5(Q, n). This gives 


n 


S(Q,n) — CS) ~  a(n—1)...(n—k +1) 


S(Q,kn) _ (*")  kn(kn —1)...(kn —k +1) eae 


Proof of Theorem 2. The statement of the theorem is obvious if k = 1 or 
k = 2, so that we may assume k > 3. Let a,b,c be distinct elements of P, 
let f be a projective bijection from P onto P’, and let f(a) =a’, f(b) =U’, 
f(c) = Cc. We can suppose that a = a’ = 0 and b = b’ = 1, since otherwise we 
replace P by (P —a)/(b—a) and P’ by (P’ —a’)/(b' — a’). This replacement 
will not affect the projective equivalence of the sets P and P’, nor will it 
change the values of S(P,n) and S(P’,n). 

Let A C C be such that |A| = n and s(P,A) = S(P,n). If X C A 
and X ~ P then there are elements x,y € X such that X = {a + p(y — 
x) : p € P}. Therefore, if T denotes the set of all pairs (x,y) € Ax A 
satisfying x + p(y — x) € A for all p € P, then we have |T7'| > S(P,n). Let 
\ = c(c' — 1)- (e’(e—1))~, and let jz be a number to be fixed later. We put 


B={at+p'Qy+p—x):(2,y) €T,p ¢ Ph. 


If A Ay + p, then the set Uz 4 = {e+ p'(Ayt+p— x): p' © P’} is similar 
to P’. We can select the number jz in such a way that « 4 Ay + p for every 
(x,y) € Ax A, and the sets Uz, ((x,y) € Ax A) are distinct. Fixing such a 
ps it follows that 


s(P’, B) > |T| > S(P,n). (3) 
For every p € P we denote p’ = f(p) 


Ap = {a+p(y—a):(2,y)€T}, Bp ={a+p'Ay+u—a) : (x,y) € TH. 
and 
bp(a@+p(y—2))=a+p'(Ay+u—a2) ((2,y) €T). 


First we show that ¢, is a well-defined map from A, onto B,. To this end we 
have to prove that if (a, y) € T, (u,v) € T and 


z+ply—z)=u+plv—4), (4) 
then 


zgtp'(Aytu—2)=utp (Avt+p—U). (5) 


If p= p! =0 or p=p’ = 1 then the implication (4) => (5) is clear. If p =c, 
p’ = c, then (4) implies (1 — c)a + cy = (1 —c)u+ cv. Therefore, by the 
definition of A, 
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1-c 
l-c 


a+cQy+u-2)=(l-c)et+rAc*yy teu = (l-o)@+cy)+cp= 


1-c 
l-c 


((1—c)ut+ev)+dw=(1—-c)ut+rAcdv+cp=uted(vt+pu-—), 


which gives (5). Finally suppose p € P \ {0,1,c}. Since f is projective, we 
have (0;1;¢;p) = (0;1;¢;p’) and thus \ = p(p’ — 1) - (p'(p — 1))~1. Then 
a computation identical to the one above shows that (4) implies (5) in this 
case as well. This proves that the map yp is well-defined, and it is clear that 
bp(Ap) = Bp. 

It follows from the definition of T that A, C A, and hence |A,| < n. 
Thus we have |B,| <n, and then |B] < >),<p|Bp| < kn. Combining with 
(3), this gives S(P’, kn) > S(P,n). Then, by Lemma 1, we obtain S(P’,n) > 
k-?*§(P,n). Interchanging the roles of P and P’ we get the other inequality 


of (2). 


In order to prove the implication (i) ==> (ii) of Theorem 1, we may assume 
that |P| = 4. Indeed, if (i) holds for P then it holds for every four-element 
subset of P as well, and if (ii) holds for every four-element subset of P, then 
it also holds for P. 

Let a be a transcendental number, and denote 


Sr, = S$ ({0,1,2,a},n) (n> 4). 


The value of S, does not depend on the choice of a. Indeed, if 6 is another 
transcendental number, then there is a field-automorphism o of C such that 
o(a) = B. For every X C C, |X| = 4 we have X ~ {0,1,2,a} if and 
only if o(X) ~ {0,1,2, 6} and this easily implies that S({0,1,2,a},n) = 
S({0, 1,2, 6}, n) for every n > 4. 


Theorem 3. There are positive absolute constants cy and cg such that 
for every quadruple P Cc C, if the cross ratio of the elements of P is 
transcendental, then 


C1° Sn < S(P,n) < cg: Sn 
for every n> 4. 


Proof. Let P = {a, b,c, d} and (a; b; c;d) = a. We put 8 = 2/(2—a) and P’ = 
{0, 1,2, 6}. Since (0; 1; 2; 6) = a, the sets P and P’ are projective equivalent. 
Consequently, by Theorem 2, there are absolute constants c,,c2 > 0 such 
that cy - S(P’,n) < S(P,n) < c+ S(P’,n) for every n > 4. Also, since 
G is transcendental, we have S(P’,n) = S, by the remark preceding the 
theorem. 


Now, in order to prove the implication (i) => (ii) of Theorem 1, it is 
enough to show that S;, = o(n”) as n + oo. Indeed, suppose this is true, and 
let P C C be a four-element set for which (i) holds but (ii) does not. Then, 
by Theorem 3, c2- Sn > S(P,n) > c-n? for every n > 4, a contradiction. 
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The rest of the paper will be devoted to the proof of S;, = o(n”). In the 
sequel we denote 


S(n, ©) = $({0,1,2,¢h.n) (CE C\ {0,1,2},n EN), 
Lemma 2. For every n we have S(n,c) > Sy for all, but a finite number of 


cEC. 


Proof. Let a be transcendental and let A = {a1,...,a@n} C C be such that 
s({0,1,2,a},A) = S,. This means that there is a set I of quadruples of 
indices such that |I| = S,, and for every (i,j,k,m) € I we have 


a; —2a;+a,=0 and (a—1)a;j—aaj;+am =0. (6) 
Let 


n 


V={ Donan Qa), spot), 


i=1 


then V is a linear space over the field Q(a). Let B = {bi,...,ba} be a basis 
of V, and let 


d 
= 1G ob; (i =1,...,n) (7) 
j=l 
be representations with r;; € Q(a) for every i = 1,...,n and j = 1,...,d. 


Since the coefficients of a;, aj, @%,@m in the equations (6) belong to Q(a), and 
bi,...,6¢ are linearly independent over Q(q), it follows that substituting the 
representations (7) into the equations (6) we obtain identities. In other words, 


for every choice of the variables x1,...,%q¢, the numbers 
d 
a= > ry; (i =1,...,n) (8) 
j=l 


will satisfy the equations 
ci — 2c; +cx, = 0 and (a — 1)¢; — ac; +em =0 (9) 


for every (1,7,k,m) € I. The right-hand sides of (8), as linear forms, are 
different, because for 7; = b; they have different values (namely aj,...,@n). 
Then 21,...,2q can be chosen to be integers such that the values of the 
corresponding c,,...,Cy, are different. Indeed, for every i; 4 72, the set 
{(a1,...,%a) : Ci, = Cin} is a hyperplane, and Z@ cannot be covered by 
finitely many hyperplanes. Clearly, if the x;’s are integers then c; € Q(q) for 
every 7 =1,...,7. 

We have proved that there are distinct elements c; € Q(a) satisfying the 
equations (9). For every i = 1,...,n there is a rational function R; with 
rational coefficients such that c; = R;(a). Thus we have 


Ri(a) — 2R;(a)+ Re(a)=0 and (a—1)Ri(a) —aR;j(a) + Rm(a) =0 
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for every (i,j,k,m) € I. Since a is transcendental, the rational functions 
R;(t)—2R,(t)+R,(t) and (t—1)R,(t)—tR,(t)+Rm(t) must be identically zero 
for every (i,j,k,m) € I. Therefore, whenever the numbers Ri(c),...,Rr(c) 
are defined (that is, c is not a root of the denominator of R,-...-R,), and are 
distinct, then the set A. = {Ri(c),...,Rn(c)} contains S, subsets similar to 
{0,1,2,c}. The rational functions R; are distinct, since they have different 
values at a. Then |A,| =n for all but a finite number of c’s, and for such a 
c we have S(n,c) > S({0,1,2,c}, Ac) > Sn. 


We remark that, in fact, S(n,c) = Sn holds for all but finitely many 
c € C. As we shall not need this result, we omit the proof. What we need is 
the fact that S;, < S(n,k) for every k € N, k > ko(n). This implies 


Sn. . S(n,k 
lim sup — < limsup [ lim sup el) ; (10) 
noo n? noo k—-00 n? 


and hence, in order to prove S;, = o(n?), it is enough to show that the right- 
hand side of (10) is zero. In the next theorem we prove somewhat more. 


Theorem 4. 


lim 28) =0. 
n—-oo n2 
k—-o0o 


Let 2,q,---,da € Z and X1,...,XqaEN. We shall say that the set 


d 
Re d1y 0 44s Kaye Xa) = {24} eas te € Z,0 <x; <X(i=1,..0)} 


i=1 


is a d-dimensional arithmetical progression of size i ay X;. We shall need 
the following theorem proved by G. A. Freiman in [38-5] and I. Z. Ruzsa in 


[6]. 
Theorem 5. If AC Z,|A| =n, and |A+ Al < cn, then A is contained in a 


d-dimensional arithmetical progression of size not exceeding c'n, where d and 
c’ only depend on c. 


Let A C Z be a finite set, let G = (A, FE) be a graph, and put S = {a+b: 
(a,b) € E}. The following result was proved by A. Balog and E. Szemerédi 
in [1]. 


Theorem 6. If |A| =n, |E| > cin? and |S| < can, then there is a subset 
A’ CA such that |A’| > cgn and |A’+A'| < can, where c3 and c4 only depend 
on cy and cg. 


As Balog and Szemerédi remark, these two theorems can be combined to 
obtain the following result: if |A] =n, |E| > cin? and |S| < con, then there 
is a d-dimensional arithmetical progression R of size not exceeding c5n such 
that |RM A] > cen, where d, cs, cg depend only on c; and cs. In the next 
lemma we prove a slight improvement of this result. 
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Lemma 3. /f |A| = n, |E| > cin? and |S| < can, then there is a d- 
dimensional arithmetical progression R of size not exceeding ci,n such that 
the subgraph of G induced by the set RN.A contains at least cj2n? edges. Here 
the constants d, c1, and cig depend only on cy and co. 


Proof. In the sequel c7, cg, ... will denote constants depending only on c; and 
Cg. Since |E| > c,n?, there is a subset A, C A such that in the subgraph of 
G induced by A,, the degree of every point is greater than cyn/2. (Indeed, 
delete one by one each point of degree at most cyn/2. In this way we cannot 
remove all points of A, and the set of remaining points will have the required 
property.) Then |A;| > cyn/2. Applying Theorem 6 to the graph induced by 
Aj, we obtain a subset A’ C Ay such that |A’| > c7n and |A’ + A’| < egn. 
Then we apply Theorem 5 to obtain a d-dimensional arithmetical progression 
R= R(z,q,---,4a;X1,---; Xa) such that A’ C R and the size of R does not 
exceed cgn. 

Let cio = cic7/8 and let B be the set of those points of A that are 
connected to at least cign points of A’. Since there are at least (c7n - 
cyn/2)/2 = 2cion? edges starting from the points of A’, we have |B| > cion. 
Let R = {R+b:b€ B}. Since A’ C R, each set R+b (b € B) contains 
at least cj9n elements of the form a+ 6, where a € A’ and (a,b) € E. 
These elements belong to S and, by assumption, || < con. This implies 
that any pairwise disjoint subsystem of R contains at most co/cio sets. Let 
{R+b;:i=1,...,k} be a maximal disjoint subsystem of R. Then k < c2/cio 
and for every b € B there is an i < k such that (R+6)N(R+0;) £9. This 
implies b € (R — R) + b;, and hence B C U*_,[(R— R) + bi]. Let bey = z 
and H = Uf*!|(R— R) +0]. Then A'UB C Has A’ C RC (R—R) + p41. 
Thus the subgraph of G induced by HM A contains at least cj2n? edges 
(cig = cf) /2), since every point of B is connected to at least cion points of 
A’. Let qa4j; = 6; for j =1,...,4 +1, and put 


dt+k41 
R={ ‘ Vig i —-Xi <a < Xi(1<i<d) and 00 < Ad <isdtk+n)}, 
i=l 


Then H Cc R’, and in order to complete the proof of the lemma it is enough 
to note that R’ is a d+ k+1-dimensional arithmetical progression of size not 
exceeding 2*+1[]%_, (2X;) < 24 *+1egn = cyn. 


Lemma 4. Let n,k € N,c> 0, and suppose that S(n,k) > en?. Then there 
are positive constants d, c', c’ depending only on c, and there exists a d- 
dimensional arithmetical progression R = R(z,q1,---,da;X1,---;Xa) such 
that the gcd(q,.--,da) = 1, the size of R is at most c'n, and for a suitable 
integer u the set 


{(a,b)€ Rx R:2a+k(b-—a) =u} (11) 


contains at least cn distinct pairs. 
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Proof. Since S(n,k) > en?, there is a set A C C such that |A| = n and A 
contains more than cn? subsets similar to {0, 1, 2,k}. Repeating the argument 
of the proof of Lemma 2 with Q instead of Q(a), we can see that A can be 
chosen to be a subset of Q. Multiplying by a suitable integer, we may assume 
that AC Z. 

Let E denote the set of all pairs (a,b) such that each number a, b, (a + 
b)/2 and a + £(b — a) belongs to A; then |E| > cn?. Let S = {a+b: 
(a,b) € E}, then S C 2A, and hence || < n. Therefore, by Lemma 3, there 
are positive constants d, c’, c’” depending only on c and there exists a d- 
dimensional arithmetical progression R = R(z,q1,.--,qa; X1,.--,;Xa) of size 
not exceeding c/n such that AN R contains at least c’n? edges from E. Let 
F = {(a,b) € EF: a,b € R}. Then |F| > c’n? and |{2a + k(b — a) : (a,b) € 
F}| < |2- A] = n. Consequently, there exists an element u € A such that 
the set defined in (11) contains at least c’n distinct pairs. If (q1,...,qa) = 1, 
then the proof is complete. Otherwise let (q1,...,¢a) = m, and replace R 
by R(z,q/m,...,ga/m;X1,..., Xa). It is clear that this modified R satisfies 
the requirements. 


Lemma 5. Let R = R(z,q,.--,qa;X1,---;Xa) be a d-dimensional arith- 
metical progression with (q1,.--,da) = 1, and denote s = tee X; and 
M = mini<i<a Xi. Then for every positive integer k, the number of elements 
of R in any residue class mod k is at most (s/M) + (s/k). 


Proof. For i=1,...,d define 


(k, q1,--+5Gi-1) k= k 
(Ry Giger 5.) ka 


We have k;...kq = k. Now consider the k numbers 


ki = 


yig +..-+yaqa,0< yi < ki — 1. 
We show that they are all incongruent modulo k. Indeed, suppose that 
yg t.--+ Yada =yiat.---+y4da (mod k). 
Let j be the largest subscript for which y; 4 y;. Since the first 7-1 summands 
are divisible by (q1,...,q;-1), we have 
(k,q1,---5 45-1) |Q5 (Uy — 94) 
hence 
(kK Q1,++-, qj-1) 
(k, q1, oe 24g) 


which contradicts the assumption 0 < yj, yj < kj —1,yj Ay}. 
Let R* be the set of those elements of R that are =a (mod k). The sets 


kj = 


ty =, 


R*+yigat+...+yada, O<y%<k-1 
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lie in different residue classes modulo k, hence they are disjoint, and they are 
contained in R(z,q1,---,da,X1 + ki —1,...,Xa+kq — 1), consequently 
kIR*| <] [+e -1) 
(we recall that k =[]&;), or 
s ky - k; —1 
Rl< 14 F 
els gID(+ Ae") sg11 (+4) 


To complete the proof we show that the inequality 


d 
ki — (14) -1 
1 <1 
ie ae 
holds for arbitrary real numbers k; > 1,M > 1. For d = 2 this inequality 


asserts that 
ky -—1 kg —1 kiko —1 
1 <14+ ———_. 
(+9) +S) s+ 


After multiplying by M and rearranging this becomes 
(ki — 1) (ke — 1) 
M 


which is true if all the variables are > 1. The case for d > 3 now easily follows 
by an induction on d. 


< (ki — 1)(ke — 1), 


Now we turn to the proof of Theorem 4. Suppose that limsup,, ;_,. 
S(n,k)/n? > 0. Then there is a constant : > 0 such that for every K there 
are integers n,k > K with S(n,k) > cn?. Thus, by Lemma 4, there are 
positive constants d, c’, c’ such that the followine statement holds: 


(x) for every & there is a d-dimensional arithmetical progression 
Rea eG be Re eg 


and there are integers n,k > K such that (qf,..., qi) = 1, the size of 
Rx is at most c’n, and for a suitable integer u the set 


{(a,b) € Rk x Re : 2a+k(b—a) = u} 
contains at least c’n distinct pairs. 


We prove that this is impossible. Let d be the smallest positive integer 
such that (*) holds for suitable positive constants c’ and c”. If 2a+k(b—a) = u 
then 2a = u (mod k) and hence, if Rx, n, k are as in (*) then 2: Rx contains 
at least cn elements in one of the residue classes mod k. This implies that 
Rx contains at least c’'n/2 elements in one of the residue classes mod k. Let 
Mx = mini<i<a he? Then, by Lemma 5, we have 


c"/2< (¢/Mx) + (¢/k). (12) 
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Let C = 4c'/c’. If K > C, then k > K implies c’/k < c’/4 and thus (12) 
gives Mx < C. By rearranging the indices we may assume that X < C 
holds for every K > C. This implies d > 1. Indeed, if d = 1 then we have 
|Rx| =|X#| < C for K > C. This gives |Rx x Rx| < C?, which contradicts 
(*) for K > C?/c". 

We complete the proof by showing that («) also holds for d — 1 instead 
of d, if we replace c” by &”/C?. Since d was minimal, this will provide the 
contradiction we were looking for. 

For every K we put Ri, = R(z*,q*,... a XE, ...,X ,). Also, for 
every a € Re we choose a ae a= 2 + + iat aig with 0 < 
a, < X;(i=1,...,d) and define a’ = z* + mt ") igi. In this way we have 
defined a map a+> a’ from Rx into Ry. 

Let kK > C, and let Rx, n, k and wu be as in (*). Obviously, a—a’ € 
{i- gi :0<i< a —1} for every a € Rx and hence the set 


P = {2a'+k(b' —a’): (a,b) € Re x Rx, 2a+k(b—a) =u} 
contains at most (X/‘)? distinct elements. Since K > C, we have X/§ < C 
and thus |P| < C?. This implies that for a suitable wu’ the set 

{(a', 0) € Ra x Rie 2a’ +k(b' — a’) = u'} 


contains at least (c’/C7)n distinct pairs. Therefore, replacing c’” by c’/C?, 
the d — 1-dimensional arithmetical progression R, and the integers n, 
k, u’ will satisfy the statement of (*) apart from the condition that 


(qk,...,q,) = 1. But this condition can also be fulfilled if we replace 
Rig by 
R(z* gt /m,...,dg-1/m Xf, ..., X44); 
where m = (qi,...,q*_1). 
References 


1. A. Balog and E. Szemerédi, A statistical theorem of set addition, Combinatorica 
4 (1994), 263-268. 

2. G. Elekes and P. Erdés, Similar configurations and pseudogrids, preprint. 

3. G. A. Freiman, Foundations of a structural theory of set addition (in Russian), 
Kazan Gos. Ped. Inst., Kazan, 1966. 

4. G. A. Freiman, Foundations of a structural theory of set addition, Translation of 
Mathematical Monographs vol. 37, Amer. Math. Soc., Providence, R. I., USA, 
1973. 

5. G. A. Freiman, What is the structure of K if kK + K is small?, Lecture Notes in 
Mathematics 1240, Springer, Berlin-New York 1987, pp. 109-134. 

6. I. Z. Ruzsa, Generalized arithmetical progressions and sumsets, Acta Math. Sci. 
Hung. 65 (1994), 379-388. 


* 


On Lipschitz Mappings Onto a Square 


Jitt Matousek** 
J. MatouSek (2) 
Department of Applied Mathematics, Institute of Theoretical 
Computer Science (ITI), Charles University, Malostranské ném. 25, 


118 00 Praha 1, Prague, Czech Republic 
e-mail: matousek@kam.mff.cuni.cz 


1. Introduction 


The following problem was posed by Laczkovich [5]: Let E C RR¢ (d > 2) be 
a set with positive Lebesgue measure \“(E) > 0. Does there exist a Lipschitz 
mapping f : R¢ > Q = [0,1]¢, such that f(E£) = Q? Preiss [6] answered this 
question affirmatively for d = 2: 


Theorem 1. Let E C R? be a set with \7(E) > 0. There exists a Lipschitz 
mapping f of the plane onto the square Q = [0,1]?, such that f(E) = Q. 


Later it was observed by Jones that this theorem is also an easy 
consequence of a much earlier result of Uy [8]. 

In this note we give a somewhat different proof of this theorem based on a 
well-known combinatorial lemma due to Erdos and Szekerés. By an additional 
trick, we also prove the following “absolute constant” version: 


Theorem 2. There exists a constant c > 0 such that for any E in the plane 
with Lebesgue measure \7(E) = 1 there exists a 1-Lipschitz mapping f of the 
plane onto the square Q = (0,c]?, such that f(E) = Q. 


The value of c obtained from our proof is quite small. Clearly some 
improvement is possible, but it seems that our method is not suitable for 
obtaining the best possible value of c. It is easy to see that one cannot hope 
to push c arbitrarily close to 1 (e.g., consider EF being a disk of unit area; its 
diameter 2/,/7 is smaller than the diagonal of the unit square, and so there 
is no 1-Lipschitz mapping onto the unit square). 


“This paper was originally published in 1997. The present version is a minor revision 
from 2012, which includes some references to newer developments. 
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Laczkovich’s question for d > 2 remains open; a combinatorial problem 
related to an attempt to generalize our proof is stated at the end of Sect. 2. 


2. Proof of Theorem 1 


The metric in R?, implicitly considered in Theorem 1, is the usual Euclidean 
metric. It appears more convenient to work with the maximum metric do 
defined by 


doo((X1,Y1), (2, y2)) = max(|r1 — x2], |y1 — yel)- 


This metric will be used throughout the rest of this paper. Clearly this 
modification does not affect the validity of Theorem 1. 

We begin the proof by a simple lemma, which is essentially contained 
in [6]. Let a > 0, w > 0 be real numbers and y : [0, a] — [0, a] be a 1-Lipschitz 
real function. Let Q denote the square [0,a]?. We partition Q into three 
regions defined as follows (see Fig. 1): LZ (resp. S, resp. U) is the set of points 
(x,y) € Q with y < p(x) — w (resp. v(x) —w < y < v(x) + wu, resp. 
y > v(x) + w). We define a mapping f = fy.w : Q > [0,1] x [0,1 — 2u] 


U 
Lo A 
f(U) 


Fig. 1 Contraction along y 


(called the contraction of Q along y by w) as follows: 


(x, min(y, 1 — 2w)) for (a, y) € L 
f(x,y) = ¢ (x, max(y — 2w,0)) for (x,y) EU 
(a, max(min(y(x) — w, 1 — 2w),0)) for (a, y) € S. 


Lemma 3. With a,w,y as above, f = fw is a 1-Lipschitz mapping. 


Proof: Let p, = (a1,y1) and po = (22,y2) be two points in Q and 
gq = (u1,01) and gg = (ue2,v2) their f-images, resp., and suppose that 
v1 > vg. We must show doo (p1, p2) > doo (1, G2). First, if |uy — ua] > |v — vol, 
then doo(q1, q2) = |ui — val = |a1 — va] < doo (pi,p2), as f does not alter the 
x-coordinate. On the other hand, suppose that |uy—ug| < |vi —ve| = v1 — v9; 
then q, € f(S)Uf(L) implies gz € f(L), as f(S) is a graph of the 1-Lipschitz 
function « ++ max(0,min(1 — 2w,y(x) — w)). Therefore f decreases the 
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y-coordinate of p; by at least as much as the y-coordinate of po, and we 
have doo(41, 42) = U1 — V2 < Y1 — Y2 < doo(P1, Pe). 

We also note that the mapping f in the above lemma does not increase 
the measure of sets. 

Let E C R? be asin Theorem 1. By the Density Theorem for the Lebesgue 
measure, one can choose a square Qo with side ay and a compact subset 
K C EN Q with A?(K) > 0.99A?(Qo). For notational convenience, let us 
assume Qo = [0,a0]”. Let fo : R? > Qo be a 1-Lipschitz retraction of the 
plane onto Qo. 

We derive Theorem 1 from the following lemma. 


Lemma 4. Let Q = [0,a]? be an azis-parallel square, let K C Q be compact 
with \7(K) = 47(Q)(1 —«), 0 < e < 0.01. Then one can find a 1-Lipschitz 
mapping g of Q onto Q' = [0,a’|? such that 


(i) a’ > a(1— ve), 
(ii) doo (p, g(p)) < ave for any p€ Q, 
(iii) \2(K’) > 42(Q’)(1 — 0.92), where K! = g(K). 


Assuming this lemma, the proof of Theorem 1 is finished by an inductive 
construction as follows. Suppose that we have already constructed a square 
Qi; = [0,a;]? and a 1-Lipschitz mapping f; of R? onto Q; in such a way that 
d?(Ki) > A?(Q;)(1—-€:), where K; = f;(K). We apply Lemma 4 with Q = Qi, 
a =aj;, K = Kj, and we get a 1-Lipschitz mapping g; : Q; > Qi+1 = (0, ai41]? 
with properties as in the lemma. We set fj+1 = g; 0 fj and continue by the 
next step of the construction. 

From the Lemma we get €:41 < 0.9€; and aj41 > aj(1 — Vé;). We 
thus have ¢; < 0.01(0.9)’, a; > ao lems — 0.95//10), and straightforward 
estimates give a = lim;_,., aj > 0.1lag. 

For every point p € R?, we set f(p) = limi+x fi(p). Condition 
(ii) of Lemma 4 guarantees that the limit exists. The mapping f is clearly 
1-Lipschitz, and its image is contained in the square Q = [0,a]?. The set 
f(K) is compact and it is easily seen that it is also dense in Q, and thus 
f(K) = Q. The proof of Theorem 1 is finished by rescaling f so that it maps 
K onto [0, 1}?. 


Proof of Lemma 4: Let G be the set of squares with side a/n from an 
n X n grid covering Q@. Let Bp C G be the set of squares s € G with 
\*(s\ K) > 2A?(s). Choose n so large that the squares of Bo contain at 
least 1/2 of the measure of Q \ K; this is possible by elementary measure 
theoretic considerations. Then |Bo| > en?/2. 

The required mapping g will again be constructed inductively, this time 
in a finite number t of steps. Let us explain the first step of the construction. 
Let yo : [0,a] — [0,a] be a suitable 1-Lipschitz function; its choice is the 
heart of the proof. The requirement on Yo is that either the graph of yo 
(i.e. the set {(a, yo(x)); x € [0, a]}) or the graph of yo rotated by 7/2 (i.e. the 
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set {(yo(y),y); y € [0,a]}) contain the centers of possibly many squares of 
Bo, namely at in ,/|Bo| such centers. We will explain the construction 
assuming that the first case occurs (the graph of yo contains at least \/|Bo| 
centers). The second case is handled symmetrically, by exchanging the role 
of the coordinate axes. 

Let Do © Bo be the squares whose centers are contained in the graph 
of yo, |Do| > VW |Bo|. We let Go : Q + Q be the contraction of Q along Yo 
by a/n (see the definition above Lemma 3). We define an auxiliary mapping 
ho : [0,a] x [0, a(n — 2)/n] > [0,a(n — 2)/n]?, acting as the identity on the 
first n — 2 columns of the grid and contracting the last two columns to a 
vertical segment, and we set go = ho ° go. Thus, the range of go is the square 
Qi = [0, a(n — 2)/n]?. 

It is easily checked that the mapping go maps each square of the grid G 
into some square of G in Q,. All squares in Do are contracted to pieces of 
Lipschitz curves. 

We define B, = {s € G; As’ € Bo : go(s’) C s and \?(go(s’)) > 0}. Thus, 
we have |B,| < |Bo| — |Do|, so at least |Do| squares of Bo are “killed” by go. 

Similarly we define mappings g1, g2,.... Having defined the mapping g;-1, 
we put B; = {s € G; ds’ € By_1 : gi_i(s’) C s and \?(g;_1(s’)) > O} and 
we construct an appropriate 1-Lipschitz mapping g; : Q; — Qj+1, where 
Qi = (0, a(n — 21) /n]?. The mapping g; kills at least \/|B;| squares of B;, in 
the sense that |B;| —|Biii| > W/|Bil. 

Let ¢ be the first index such that |Bo| — |B;| > en?/5. For i < t, we 
thus have |Bo| — |B;| < en?/5 (and therefore |B;| > en?/4). In particular, we 
get en?/5 > |Bo| — |B] > i=) VIBil > (t — 1)nVe/2, which implies that 
t< 2n/eE+1< nVe/2 (since we may assume that nvé is large). 

By composing the mappings go, .-., 9:1, we obtain a 1-Lipschitz mapping 
g of Q onto Q’ = [0,a’']?, with a’ = a(1 — 2t/n) > a(1 — Ve). Under this 
mapping, at least en?/5 of the squares of By have images of measure 0, and 
these squares contain at least #2 (a?/n?)(en?/5) = ca? of the measure of 
Q\ K. Hence *(g(Q\ K)) < Bea? and \2(g(K)) = 22(Q' \ g(Q\ K)) > 
a’? — ca. Using the assumption € < 0.01, we have a’ > a(1 — Ve) > 0.9a, 
and thus \?(g(K)) > a(1- 2. Be) > al2(1 — 0.9e) as required. 

It remains to show how to chop the mappings y;, whose graphs or 
rotated graphs pass through sufficiently many center points of the squares 
of B;. Call a set D C R? 1-Lipschitz in the y-coordinate (resp. in the 
x-coordinate) if jy; — y2| < |x1 — x2] (resp. |a1 — ra] < ly: — yol) for any 
two points (11, y1),(v2,y2) € D. It is easy to see that if D is a finite set 
which is 1-Lipschitz in the y-coordinate, then there is a 1-Lipschitz function 
whose graph contains D. Hence it suffices to establish the following 


Lemma 5. Let P be an m point set in the plane, k = [./m]. Then there 
exists either ak point subset of P 1-Lipschitz in the x-coordinate or ak point 
subset of P 1-Lipschitz in the y-coordinate. 
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Proof: Call a set C in the plane a nondecreasing chain if for any two points 
(21,41), (2, y2) € C with x, < xq we have y; < yo; a nonincreasing chain 
is defined analogously with an opposite inequality for the y-coordinates. 
A lemma due to Erdés and Szekerés [4] can be stated as follows: Any 
m point set in the plane contains a k& point nondecreasing chain or a 
k; point nonincreasing chain. To see the relevance to the above lemma, rotate 
the coordinate system by 7/4; then nondecreasing chains become precisely 
1-Lipschitz subsets in the x-coordinate and nonincreasing chains become 
1-Lipschitz subsets in the x-coordinate. 

For a direct application of the above method for Laczkovich’s problem in 
dimension d, one would need the following: Given an m-point set P in R, 
find a subset S of size em!~!/4 (c > 0 a constant) which is 1-Lipschitz in the 
x;-coordinate for some 7 = 1,2,...,d. Here a subset S C P is C-Lipschitz in 
the 2;-coordinate if for any two points a = (a1,...,@a),0 = (b1,...,ba) € S$ 
one has |a; — b;| < C max;z;(|a; — 0;|). As noted by Gabor Tardos, already 
for d = 3 such a large 1-Lipschitz subset need not exist; see [7]. Indeed, let 
PCR? be the set {(i,j,i +7); i,7 =1,2,..../m}; it is not difficult to check 
that there is no subset of size exceeding O(,/m) which is 1-Lipschitz in one 
of the coordinates. Still, it is possible that there always exists a subset of size 
cm!—1/4 which is C-Lipschitz in one of the coordinates, C' a constant. This 
problem looks interesting in its own right, although it is not completely clear 
whether a positive answer would solve Laczkovich’s problem. 


Update from the year 2012. Alberti, Csérnyei, and Preiss gave a negative 
solution to the last stated problem (apparently still unpublished; see [1]). 
However, the following variant of the problem, from [2], still remains open, 
and a positive answer would also have interesting analytic consequences: Is 
there a constant C such that for every n¢-point set in IR? there exists a 
coordinate system and a subset of at least n?—! points that is C-Lipschitz in 
one of the coordinates? Further significant progress regarding Laczkovich’s 
problem and related questions was announced by Csérnyei and Jones (see [2]). 


3. Proof of Theorem 2 


In the above proof of Theorem 1, the only part where an arbitrarily large 
part of the measure of F is wasted is the initial application of the Density 
Theorem for the choice of the square Qo with \7(Qo  E) > 0.99A?(Qo). In 
the rest of the proof, Qo M E is then mapped onto a square with side at most 
10 times smaller than the side of Qo. Thus, it suffices to prove the following 
lemma: 


Lemma 6. For any <9 € (0,1) and any E C R? with \7(E) = 1, there exists 
a 1-Lipschitz mapping f : R? + Q = [0,c]? with \7(f(E)) > (1 — €0)A?(Q), 
where c = c(é9) > 0 only depends on éo. 
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Proof: The proof is probabilistic, using the so-called Second Moment 
Method; see, e.g., [3]. 

Fix c = \/éo/7. For simplicity, we will construct a 2-Lipschitz mapping 
onto [0,c]?; the Lemma is then obtained by taking c/2 instead of c and 
rescaling. Choose an integer N large enough so that some axis-parallel square 
S' with side 3cN contains at least 9/10 of the measure of E. Let G be an Nx N 
grid consisting of 3c x 3c axis-parallel squares; the squares of G are denoted 
by sjj,7=1,2,....N denoting the row index and j = 1,2,...,.N the column 
index. Let Mj; denote the middle c x c square in s;;, and set M = U4 M;;. 
Since 9 translational copies of the set M cover the square S, it is possible to 
place the grid G in such a way that A?(M 1m E) > 1/10; we assume that such 
a placement has been fixed. 


ls 


Si 
A 
Ss 


Fig. 2 (a) The grid G and the set M; (b) the mapping g- 


Consider a grid square s;;. Let z = z;; € [0, 2c]? be a vector. We define a 
mapping g, : 84; —> $4; as follows. We require that g, is the identity map on 
the boundary 0s;,; of s;;, and on M;;, gz acts as the translation by the vector 
z —(c,c) (see Fig.2). With respect to the d,, metric, g, is a 2-Lipschitz 
mapping on 0s,; U M;;. We extend it to a 2-Lipschitz mapping s;; > s;; 
(this is possible since for any metric space Y and any X C Y, a C-Lipschitz 
mapping from X to the plane with the d,, metric can be extended to a 
C-Lipschitz mapping defined on Y; see, e.g., [9]). 

For later use, we observe that if z is randomly chosen from the uniform 
probability distribution on [0, 2c]?, then for any fixed x € M;;, we have 


(EN Mi;) 
4c? 

(since for each y € EN Mi;, 9z—y+(c,c) Sends y to x). 
Consider Z = {(z;; 1,7 =1,2,...,N); 2 € [0,2c]?} with the product 

probability measure, and let z € Z be a random element. Let 9, : G—~ G 


Prob(a € g2(E M Mi;)) > (1) 
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be the 2-Lipschitz mapping whose restriction on each s;; is gz,,. Further we 
define a 1-Lipschitz mapping f; : G — [0, 3c]? corresponding to a “folding” 
of the grid G (as if G were a piece of paper and we wanted to fold it to one 
little square). Formally, if p € s;; is a point in G with displacement (x, y) 
from the lower left corner of s;;, we set 


(x, y) for i, 7 odd, 
Oe (x, 3c — y) for i odd, 7 even, 
WP) = (3c — x,y) for i even, j odd, 


(3c —x,3c—y) for i,7 even. 


Finally, we let fo be the retraction of the plane onto the grid G, and fo : 
(0, 3c]? > Q be the mapping translating the middle c x c square of [0, 3c]? 
onto Q = [0,c]? and contracting the rest of [0,3c]? onto the boundary of 
Q. Both fo, fo are 1-Lipschitz. We put f, = foo fi og, 0 fo. This is a 
2-Lipschitz map. 

Fix any point x € [0,c]?, and let X;; be the 0/1 indicator variable for the 
event x € fe(ENMs5) (this is a random variable depending on the choice of z), 


and set X = 1 X;;. From (1), we get for the expectation of each X;; 


1,j=1 
1 
E Xi; > 2 —.}? (EN Mi;), 
and so 
N 
1 1 
EX = E X;; > > ma 
by I= 402 


As the variables X;; are independent, we have 


N N 
VarX = S> VarXiz < | EX}, =) EX; =EX. 
ij=l ij=l ij 


From Chebyshev’s Inequality we thus get 

Var X wool. 1 

(EX)? — E x 

The event X #0 means « € f2(MNE), where x is our fixed point of Q. Con- 
sidering all x’s simultaneously and using Fubini’s Theorem, we get that there 


exists a Z € Z such that A?(fz,(ENM)) > (1—e0)A?(Q). This concludes the 
proof of Lemma 6. 


Prob(X = 0) < Prob(|X —EX|>EX)< 
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A Remark on Transversal Numbers 
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“What does the Hungarian parrot say?” 
“Log. Log log log log ... ” 
(Riddle. Folklore.) 


1. Introduction 


In his classical monograph published in 1935, Dénes K6nig [23] included one 
of Paul Erdés’s first remarkable results: an infinite version of the Menger 
theorem. This result (as well as the Konig-Hall theorem for bipartite graphs, 
and many related results covered in the book) can be reformulated as a 
statement about transversals of certain hypergraphs. 

Let H be a hypergraph with vertex set V(H) and edge set E(H). A subset 
T C V(A) is called a transversal of H if it meets every edge E € E(H). 
The transversal number T(H) is defined as the minimum cardinality of a 
transversal of H. Clearly, 7(H) > v(H), where v(H) denotes the maximum 
number of pairwise disjoint edges of H. In the above mentioned examples, 
T(H) = v(#) holds for the corresponding hypergraphs. However, in general 
it is impossible to bound 7 from above by any function of v, without putting 
some restriction on the structure of H. 

One of Erdés’s closest friends and collaborators, Tibor Gallai (who is also 
quoted in K6nig’s book) once said: “I don’t care for bounds involving log n’s 
and log log n’s. I like exact answers. But Paul has always been most interested 
in asymptotic results.” In fact, this quality of Erdés has contributed a great 
deal to the discovery and to the development of the “probabilistic method” 
(see [4, 14]). 

The search for “exact answers” (e.g. to the perfect graph conjecture 
of Berge [7]) has revealed some important connections between transversal 
problems and linear programming that led to the deeper understanding of the 
Konig-Hall-Menger-type theorems. It proved to be useful to introduce another 
parameter, the fractional transversal number of a hypergraph, defined by 


r(H)=min S> t(x), 


«e€V(H) 
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where the minimum is taken over all non-negative functions t : V(H) —> R 
with the property that 


» t(a) > 1 for every E € E(#). 
zeke 


Obviously, 7(H) > 7*(H) > v(H), and 7*(#H) can be easily calculated by 
linear programming. (See [25] and [30].) 

At the same time, the probabilistic (or shall we say, asymptotic) approach 
has also led to many exciting discoveries about extremal problems related to 
transversals (e.g. Ramsey-Turdn-type theorems, property B). It was pointed 
out by Vapnik and Chervonenkis [32] that in some important families of 
hypergraphs a relatively small set of randomly selected vertices will, with high 
probability, be a transversal. They defined the dimension of a hypergraph as 
the size of the largest subset A C V(H) with the property that for every 
BCA there exists an edge E € E(H) such that EM A = B. Adapting the 
original ideas of [32] and [21], it was shown in [22] (see also [27]) that 


T(H) < (1 + o(1)) dim(H)7*(H) log r*(H), (1) 


as T* —> oo, and that this bound is almost tight. 

Ding, Seymour and Winkler [11] have introduced another parameter of a 
hypergraph, closely related to its dimension. They defined \(H) as the size 
of the largest collection of edges {£1,...,E,} C E(H) with the property 
that for every pair (E;,E;),1< ij < k, there exists a vertex x such that 
xe EE; but « ¢ E, for any h F¥ i,j. Combining (1) with Ramsey’s 
theorem, they showed that 


MH) + 2) ° @) 


r(H1) < 6X) + (4) ( 


holds for every hypergraph H. 

As far as we know, Haussner and Welzl [21] were the first to recognize 
that (1) has a wide range of interesting geometric applications, due to the 
fact that a large variety of hypergraphs defined by geometric means have low 
Vapnik-Chervonenkis dimensions. 

The aim of this note is to illustrate the power of this approach by 
two examples. In Sect.2 we show that (2) easily implies some far-reaching 
generalizations of results of Erdés and Szekeres [15, 16]. In Sect. 3, we use (2) 
to extend and to give alternative proofs for some old results of Gyarfas and 
Lehel (see [19, 20, 24]) bounding the transversal numbers of box hypergraphs. 


2. Covering with Boxes 


Given two points p,g,€ R%, let Box[p,q| be defined as the smallest box 
containing p and q, whose edges are parallel to the axes of the coordinate 
system. The following theorem settles a conjecture of Bardny and Lehel [6], 
who established the first non-trivial result of this kind. 
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Theorem 1. Any finite (or compact) set P C R* contains a subset with at 
most 22°"” elements, {pill <i< ya such that 


got? 


PC lf Box[p;, pj]. 
ij=l 


Proof. Let H be a hypergraph on the vertex set 
V(H) = {Boxlp, q|lp,q € P}, 
defined as follows. Associate with each point r € P the set 
E, = {Box[p, q]|r € Box[p, q]}, 


and let E(H) = {E,.|r € P}. 

Clearly, E,E, #0 for any p,q € P, because Box{p, gq] € E,E,. Hence, 
v(H) = 1. 

According to a well-known lemma of Erdés and Szekeres [15], any 
sequence of k? + 1 real numbers contains a monotone subsequence of length 
k +1. By repeated application of this statement, we obtain that any set of 
ge 4] points in R¢ has three elements p;,p;,p% with py € Box[p;, p;]. This 
immediately implies that 


MH) < 22°” 


Indeed, for any family of more than 2?" ' edges Ep,,...Ep, € E(H), one can 
choose three distinct indices i, 7,k with py € Box[pi, pj], which yields that 


Ep, Bp; © Ep, 
Thus, we can apply (2) to obtain 
7(H) < 6A2(H) (\(H) +1)° < 2?"”," 


and the result follows. 


As was shown in [6], the bound 22"*° in Theorem 1 is nearly optimal. 
In fact, the above argument yields a slightly stronger result. 


Theorem 2. Let P C R®@ by any compact set, and let B be any family of 
boxes in parallel position with the property that for any (v+1)-element subset 
P' CP there is a box B € B which covers at least two points of P’(d,v > 1). 

224 +v a 
Then one can choose at most ( Fs ) 
will cover P. 


members of B such that their union 


In [16], Erdés and Szekeres proved the following. 


Lemma 1. Every set P C R? with at least 2" elements contains three points 
P1,P2,p3 such that <pi,p2,p3 > 7 (1— Z)- 
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Our next result, which improves a theorem of Bardny [5], can be regarded 
as a generalization of Lemma 1. 


Theorem 3. Let d be a positive integer, ¢ > 0. Every finite (or compact) set 
P CR? has a subset of at most g(e/e)* elements, P’ = {pi,po,...}, with 
the property that for any p € P \ P’ there exist p;,p; € P’ satisfying 


<pipp; = 17 —€. 
(Here c < 8 is a constant.) 


Proof. For d > 2, « > O fixed, let us cover the unit hemisphere centered 
at O € R@ with (4/e)4~! spherical (d — 1)-dimensional simplices $1, S2,... 
such that the diameter of each S; is at most ¢/2. Let hit,...,htaq denote 
the hyperplanes induced by O and the ((d — 2)-dimensional) facets of S;. A 
parallelotope whose facets are parallel to hi1,..., hea, respectively, is called a 
box of type t. The smallest. box of type t containing p,q € R®% will be denoted 
by Box; |p, q]- 

For any p,q € R®, choose an index t such that pq is parallel to Os for 
some s € S;, and let Box[p, g] = Box;|[p, q]. Notice that if r € Box [p,q] then 
<prq > 7 —€. 

Just like in the previous proof, define a hypergraph H by 


V(#) = {Box[p, qlp,¢ € P}, 
E(H) = {E,|r € P}, 


where E, = {Box[p,q]|r € Box[p,q]}, and observe that it is sufficient to 
bound the transversal number of H. Clearly, v(H) = 1. 

By the definition of A(H), one can select A(H) = A elements pj,...,p € 
P with the property that any two of them is enclosed in a box of some type, 
which does not cover any other p;,. More precisely, for every 1 <i<j <A 
there exists t(i, 7) < (4/e)4~! such that 


{p1,.--, Pr} N Boxy,3) (pi, pj] = {ps,p;}- 

Obviously, p; and p; are two antipodal vertices of Box;(;,;)[pi, pj], and every 
box has 2¢~! pairs of antipodal vertices. Let us color the segments p;p;(1 < 
i<j <A) with (4/e)4—!27-! colors according to the value of ¢(i,j) and to 
the particular position of the diagonal pip; within Box;(;,;) |p, pj]. It is easy 
to see that the segments of the same fixed color form a bipartite subgraph 
of the complete graph K, on the vertex set p1,...,p,. Hence the chromatic 
number of Kk), 


d—19d-1 


MH) < 20/9) 


and the result follows from (2). 


It is not hard to see that the bound in Theorem 3 is asymptotically tight, 
apart from the exact value of c (see [5, 13]). 
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Combining these observations with an analogue of Turan’s theorem for 
hypergraphs, we immediately obtain the following result related to a problem 
of Conway, Croft, Erdés and Guy [8]. 


Corollary 1. There exists a constant c > 0 such that, for any set of n 
distinct points p1,...Dyn € R4, the number of triples i < j < k for which 
<pipjpe, > 7 —€, is at least [n3 /2(e/e)*" | Moreover, apart from the value of 
c, this bound cannot be improved. 


Finally, we mention another straightforward generalization of Lemma 1. 


Theorem 4. Let P be any set of at least Kle/eye™ points in R*, where c 
is a suitable constant. Then one can find po,...,pe € P such that they are 
“almost collinear”, i.e., <p;-1pipiz1 > 7 — © for every i(1<i<k). 


3. Gallai-Type Theorems 


Many problems in geometric transversal theory were motivated by the 
following famous question of Gallai. Given a family of pairwise intersecting 
disks in the plane, what is the smallest number of needles required to pierce 
all of them? (The answer is three. See [9, 10, 12, 17].) 

First we show that (2) implies the following result of Gydrfds and Lehel. 


Theorem 5 ([20]). For any positive integers k and v, there exists a number 
f =f(k,v) with the following property. Let H be any finite family of subsets 
of R such that each of them can be obtained as the union of at most k intervals. 
If H has nov+1 pairwise disjoint members, then all of its members can be 
pierced by at most f points. 


Proof. In order to apply (2), we have to bound A(#). Let F1,...,£) be some 
members (edges) of H such that, for any i < j, E; 1 E; has a point x;; which 
does not belong to any other E;, (h 4 i, 7). Write each E; (1 <i < A) as the 
union of & intervals, 


A, = et ieee 


If aj; € Lip ON Ijq for some i < 7, then (£;, E;) is called a pair of type (p, q). 
(A pair may have several different types.) 

It is easy to check that there are no four edges E; such that all () =6 
pairs determined by them are of the same type. Thus, 


A < Ryz2(4), 


where R,(t) denotes the smallest number R such that any complete graph 
of R vertices, whose edges are colored with s colors, has a monochromatic 
complete subgraph of t vertices (cf. [18]). Hence, the theorem is true with 
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Ry2(4) + ae 


Vv 


f(k,v) < 6( 


Theorem 5 does not generalize to subsets of the plane that can be obtained 
as the union of k axis-parallel rectangles. Indeed, let H = {EF;|1 <i < n}, 


where 


Theorem 6. Let F' be a family of open domains in the plane such that each 
of them is bounded by a closed Jordan curve, and any two of them share at 
most two boundary points. Furthermore, let H be a finite set system, whose 
every element can be obtained by taking the union of at most k members of 
F. If H has nov+1 pairwise disjoint elements, then all of its elements can 
be pierced by at most g(k,v) points (where g depends only on k and v). 


E; = {(2,y) €R? |0<2,y<n and min(|x — 4], ly — J) < 


ALE 


Then v(H) = 1, while \(H) = 7(H) =n. 
However, one can easily establish the following. 


Proof. Pick elements (edges) of H, 
E,=IU...Ulix (lip €F,1 SiS \1<0<h), 
and suitable points 
wij © (Ei Ej) \ Ungig En, 


as in the previous proof. After defining the type of a pair (E;,E;),i <j 
in exactly the same way as above, now one can argue that there are no 
6 edges E; such that all the ($) = 15 pairs determined by them have the 
same type (p,q). Assume, for contradiction, that e.g. F,..., £6 satisfy this 
condition for some p # g. Then any Jj, (1 < 7 < 3) and any Jj, (4 <j < 
6) have a common interior point (z;;) which is not covered by any other 
Ex (k 4 i,j). We can conclude (by tedious case analysis) that there exist 
pairwise disjoint connected open subsets Tip C Ip A <i < 3), Di C Tyq 
(4 <j <6) such that every Jj, and Ij, share a common boundary segment. 
This contradicts Kuratowski’s theorem on planar maps. The case p = q can 
be treated similarly. 

Thus, \ < R,z2(6) and the result follows. We could also apply Theorem 1.1 
of Sharir [31] to deduce A < R,2(c) with a much larger constant c > 6. 


Theorem 6 can be applied to the family F¢ of all homothetic copies of a 
convex set C’' in the plane. The special case when C is a convex polygon with 
a bounded number of sides was settled by Gyarfas [19]. (An easy compactness 
argument shows that C does not need to be strictly convex.) 

For any hypergraph H and for any integer t > 1, let »,(H) denote the 
maximum number of (not necessarily distinct) edges of H such that every 
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vertex is contained in at most t of them. Furthermore, let ,(H) be the size 
of the largest collection of edges {|i € I} C E(H) with the property that 
for any t-tuple J C I there exists x; € V(H) such that 


ay € (n s) \( UE 
ie J ig J 
Clearly, 1,(H) = v(H) and A2(H) = X(#). 

Ding, Seymour and Winkler [11] have established an upper bound for 
T(H) in terms of %(H) and A141(H), for any fixed t > 1. Applying their 
result with t = 2, we obtain the following generalization of Theorem 5 for the 
plane. 


Theorem 7. Let H be a finite family of open sets in the plane such that 


(i) Every member of H is bounded by at most k closed Jordan curves; 
(ii) Any two distinct members of H have at most € boundary points in 
common. 


Assume that among any v-+1 members of H there are three with non-empty 
intersection. Then all members of H can be pierced by at most f(k, @,v) 
points, where f does not depend on H. 


In higher dimensions we obtain e.g. the following result. 


Theorem 8. Let H be a finite family of not necessarily connected polyhedra 
in R4(d > 2). Assume that every member of H has at most k vertices, and 
that among any v +1 members of H there are d+ 1 whose intersection is 
non-empty. Then all members of H can be pierced by at most g(d,k,v) points, 
where g does not depend on H. 


The special case of Theorem 8, when every member of H is the union of 
a bounded number of axis-parallel boxes, was proved by Lehel [24]. 


4. Concluding Remarks 


Alon, Brightwell, Kierstead, Kostochka, and Winkler [1] further analyzed the 
hypergraph H defined in the proof of Theorem 1. Using the same corollary 
of the Erdés-Szekeres theorem [15] as we did, they gave an upper bound on 
T*(H). Using (1) rather than (2), they obtained a slight improvement on 
Theorem 1: Instead of a subset of size gn there exists a subset of size 


924+d+log d+log log d+O(1) 
with the required property. Here log stands for the natural logarithm. 


Palvolgyi and Gyarfds [28] fine-tuned the above arguments and further 
improved the bound to 
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g2*- ‘: +log d+log log d+ O(1) 


Many related transversal problems concerning hypergraphs defined by 
geometric means and hypergraphs formed by the neighborhoods of the 
vertices of a graph were studied in [1-3] and [26], respectively. In most cases, it 
turns out that these hypergraphs or some others derived from them have small 
X or small VC-dimension. Therefore, one can apply the estimates (1) or (2). 
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Summary. We use the Gram matrix to prove that the largest number of points 
in R¢ such that the distance between all pairs is an odd integer (the square root of 
an odd integer) is < d+2 and we characterize all dimensions d for which the upper 
bound is attained. We also use the Gram matrix to obtain an upper bound for the 
smallest angle determined by sets of n lines through the origin in R°. 


1. Introduction 


Ann Xn symmetric matrix M is positive semi-definite if (Ma,«) > 0 for all 
vectors x € FE”. Equivalently, M is positive semi-definite if it is symmetric 
and its eigenvalues are nonnegative. The Gram Matrix (Grammian) of the set 
of vectors {uj,...,Un} C E% is the nxn matrix M defined by M = ((uj, u;)). 
This matrix is a positive semi-definite matrix and its rank is the dimension of 


the subspace spanned by {w1,..., Un}. It is well known that if N is a positive 
semi-definite matrix of order n x n and rank d, then N is the Grammian 
of a set of n vectors {u1,...,un} C E%. In other words, N = ((uj,u;)). 


This bi-directional relation between vectors, their inner products and the 
Grammian has provided a powerful tool for solving problems in combinatorics 
and combinatorial geometry, it plays a central role in the extensive work of 
Seidel on equiangular lines, it was used in [4] to solve a problem of L. Lovasz 


where they proved that W/2n? > max || 2 uy|| > ce, the max taken over 


all families of n almost orthogonal unit ene in R”, that is all n tuples of 
vectors such that among any three vectors there is at least one orthogonal 
pair, it was used in [9] to answer a question of P. Erdés showing that the 
maximum size of a set of almost orthogonal lines in R@ is 2d and the list can 
go on and on. One could fill a large volume exploring the many applications of 
the Gram matrix. The work in this note was motivated by the following simple 
and attractive problem that appeared in the Nov. 1993 54th W. L. Putnam 
Mathematical Competition (problem B-5): Can four points in the plane have 
pairwise odd integral distances? The answer is NO and the Gram matrix 
provides a very short proof of that. Surprisingly, we cannot do much better if 
we relax the distance requirement and permit points to have distances whose 
square is an odd integer. We prove that if the square of all distances among 
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pairs of n points in R@ is an odd integer then n < d+ 2. We show that the 
upper bound is attained if and only if d = 2 mod 4. A slight modification 
of this proof yields also an alternative proof to a theorem of R. L. Graham, 
B. L. Rothschild and E. G. Strauss [3] that d+ 2 points in R¢ with odd 
integral distances exist if and only if d= 14 mod 16. We conclude by using 
the Gram matrix to obtain an upper bound for the smallest angle determined 
by sets of n lines through the origin in R*. Cases for which this upper bound 
is attained are discussed and also some related open problems. In Sect. 2 we 
discuss distances among points and related open problems and in Sect. 3 we 
discuss angles among lines. 


2. Odd Distances Among Points in R4@ 


Theorem 1. Let pi,...,p~n be n points in R¢ such that ||p; — p;||? = 1 
mod 2 ifi #7. Then 


_fd+2 ifd=2 mod4 
n 
~ |d+1 ifd#2 mod4 


and this is best possible. 


Proof. Let ug = pr — pi and let M = ((ux,u;)) be the Gram Matrix of the 
vectors {uz}. M is a positive semi-definite matrix of rank < d and order 
nm —1, hence if m—1 > ddet(M) = 0 and therefore det(2M) = 0. From 
2(ui,ttj) = lluall? + fluall? — jus — ujl? and jes — wll? = [lpi — pill? = 1 
mod 2 we deduce that 2(u;,u;) is an odd integer. From the above discussion 
it follows that: 


2 a1,2 Q1,3 Para Q1jn+1 
Q1,2 2 a2,3 Para a2.n+1 
A=2M mod4= ‘ . : (aij = Aji = +1). 
G@n41,1 Ant1,2 * «+» 2 


Let A;;(x) be the matrix derived from the matrix A by replacing a;; and aj; 
by x. Consider the quadratic p(x) = det(A;;(x)) = ax? + bx +c. Since all 
entries in A are integers clearly, a, b and c are also integers. Furthermore, 
since A,;(x) is symmetric we also have b = 2k. Hence p(1) — p(—1) = 4k. 
This means that by replacing any pair of symmetric —1’s by 1’s the value of 
the determinant mod 4 remains unchanged or det(A) mod 4 = det(J + J) 
mod 44 (the matrix J is the square matrix with J;; = 1). Clearly 1 is an 
eigenvalue of J + J with multiplicity n — 2 and since the sum of the entries 
in each row is n, n is the remaining eigenvalue. Therefore det(A) mod 4 = 
det(J + I) mod 4 = n mod 4. As noted above, det(A) = 0 ifn >d+1 
hence we must haven =0 mod 4. If n > d+3 and if we had n points in R? 
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with mutually odd integral squared distances then n = 0 mod 4. But then 
removing one of the points will yield a set of n — 1 such points in R? while 
n—12#0 mod 4 and n—1>d+1 which is impossible. Hence n < d+ 2. 
It remains to show that for all d= 2 mod 4 it is possible to construct d+ 2 
points in R? with mutually odd integral squared distances. The following 
construction using the Gram matrix will do the job. 

Let d = 4k+2 and let A be a matrix of order 4k+3 with A;; = 2k+1 and 
Ajj = —zi # j. Since A — (2k + 3)I = —5J, (2k + 3) is an eigenvalue of A 
and since J has rank 1 the multiplicity of this eigenvalue is 4k + 2. The sum 
of the entries in each row is zero, hence 0 is the remaining eigenvalue of A. 
This implies that rank(A) = 4k + 2 =d that A is positive semi-definite and 
can be written as: A= M-M*" where M is a (4k4+3) x (4k +2) matrix. The 
rows of M will determine 4k + 3 points {p;} in R% such that: ||p;|/? = 2k +1 
and ||p; — p;||? = ||pil|? + ||p;|| + 1 = 44 + 3. These points together with the 
origin yield 44 + 4 = d+2 points as claimed. Geometrically, the d+ 2 points 
consist of the d+ 1 points of a regular simplex with side V4k +3 and its 
center. To see this, recall that the length of the edge of the regular simplex 


inscribed in the unit sphere in R4 is /2+2/d = \/ 44 if d = 4k+2. Hence 


if we inflate the regular unit simplex by 2k + 1 the distance from the center 
to the vertices will be ~2k+1 and the distance between the other vertices 


will be 4k + 3. 


A slight modification of the above proof yields an alternative proof to the 
following theorem of Graham, Rothschild and Strauss [3]. 


Theorem 2. For the existence of n+ 2 points in R” so that the distance 
between any two of them is an odd integer it is necessary and sufficient that 
(n+2)=0 mod 16. 


Proof. As in the previous proof, let {po,..., Pr4i1} be n+2 points in R” with 
mutual distances ||p; — p;|| = 1 mod 2. Let uj = p; — po, i = 1,...,n 41. 
The matrix M = ((u;,u,;)) has rank < d and order n+ 1. Hence if n > d we 
have: 


det(M) = 0 (1) 
det(M) = 0 = det(2M) =0 => det(2M) mod 16=0. (2) 
Since ||w;, ws || = ||pi, p;|| = 1 mod 2 we have: 
Jjui,uj|[? =1 mod 8 (3) 
(ui, uj) = lua? + [lug — [las — uj? = 1 mod 8. (4) 


Since ||u,;|| = 1 mod 2 we have: 


2(uj;, uz)? = 2|lus||? =2 mod 16. (5) 
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From (4) and (5) we get: 


2 @1,2 Q1,3 +++ Q1jn41 
a2,1 2 42.3... A2n41 
2M mod 16= . 
Ant1,1 An41,2 ° «+» 2 


=A (where aj; = aj; = 1 or 9). 


Again we let A;;(X) be the matrix derived from the matrix A by replacing 
aj; and aj; by x and consider the quadratic p(x) = det(Aj;(x)) = az? +br+ce. 
As in Theorem 1, a, b and ¢ are integers and b = 2k. Hence p(9) — p(1) = 
80a+16k. This means that replacing any pair of symmetric 9’s by 1’s the value 
of the determinant mod 16 is unchanged, in other words, det(A) mod 16 = 
det(J +I) mod 16. 

Since the eigenvalues of J +I are (n + 2) and (1){"} we have: 


det(J +I) mod 16=(n+2) mod 16. 


Thus det(M/) can be 0 only if (n+ 2) =0 mod 16. Again, the Gram matrix 
can be used to describe a construction of d+ 2 points so that the distance 
between any two of them is an odd integer. As in Theorem 1, it will suffice 
to construct a positive semi-definite matrix M of order d+ 1, rank d, with 
be the matrix of order 16k — 1 shown below. 


(4k 1)? Ge" k=? 
2 2 
(4k—1)? (6k — 1)? 8k2—8k4+1 | | 8k?—8k+1 
: 8k? —8k4+1 ; 
M= 2 
Cis 8k" —8k+1 : a (6k = 1)? 


A simple but tedious computation shows that (8k — 1) times the top row is 
the sum of the remaining 16k—2 rows. Hence 0 is an eigenvalue of M. Clearly, 
((6k — 1)? — 8€ —28kt1) is an eigenvalue with multiplicity 16k — 3 and using 
the trace we can see that the remaining eigenvalue is also positive. Hence 
is positive semi-definite of rank 16k — 2. Since 2(6k — 1)? — (8k? —8k+1) = 
(8k — 1)? and (6k — 1)? + (4k — 1)? — (4k — 1)? = (6k — 1)? is the desired 
matrix. The matrix M is the Gram matrix of the vectors determined by the 
d+ 2 points constructed in [3]. 


Remark 1. Unlike the d+ 2 points constructed in Theorem 1, this set is a 
3-distance set. H. Harbroth (private communication) showed that there is no 
2-distance set of d+2 points in R¢ so that the two distances are odd integers. 
On the other extreme it might be interesting to construct d+2 points so that 
all distances (squared distances) are distinct odd integers. Both theorems raise 
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some interesting questions already in the plane. If we define a graph whose 
vertices are the points in the plane and connect two vertices by an edge if their 
distance (distance squared) is an odd integer then this graph does not contain 
a K4 (Ks). Do etther of the two graphs have a finite chromatic number? 
(Clearly if the odd distance-squared graph has a finite chromatic number so 
does the odd distance graph). This problem is similar to Nelson’s classical 
problem of coloring the plane so that points at distance 1 get distinct colors. 

As noted by P. Erdés, Turan’s theorem implies that the maximum number 
of distances among n points in the plane that are odd integers is < n?/3. On 
the other hand, one can easily construct n points on the line with n?/4 odd 
distances. Erdés asked whether it is possible to construct n points in the plane 
so that more than n?/4 of the distances will be an odd integer. This question 
as well as the similar question with odd squared-distances is also interesting 
in higher dimensions. 


3. Angles Among Lines in R@ 
Let LD = {Ii,..., Ln} C R% be a set of lines through the origin. Let a(L) 


denote the smallest angle among all angles determined by pairs of distinct 
lines from L. Let a(n, d) = sup{a(L)|L = {Ii,..., Ln} Cc RF. 


d(n—1)° 
[2] For each n and d there is a set of lines L = {Ly,...,Ln} C R® such 
that cosa(L) = cosa(n, d). 


Theorem 3. /1] cosa(n,d) > J wo 


Proof. We assume that n > d. Let u; be a unit vector along the line D; and 


let M be the Grammian of {u1,...,un}. Note that cos? a(L) = max(u;,u;)?. 
Since M is positive semi-definite of rank < d it has at most d nonzero 
eigenvalues {A1,...,Aa$, A; > 0. Since the u,’s are unit vectors, Mj, = 1 
and therefore: 
d 
trace(M) =n = > ri (6) 
i=1 
non d 
trace(M*) = by (ui, uj)” = a a (7) 
i=1 j=1 i=1 
From (6): 
d 
2 nz 
od (=) ee La 
L¥24(4) =5 
t=1 
From (7): 
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Or: 

’ —d —d 
n(n — 1) max((u;,u;)”) > “ n= mn ) => max((uj,uj)”) > a 
To prove (7) note that maxcos?a(L) = max{max((u;,u;)”)} where the 


inner maximum is taken over the (5) pairs of distinct vectors and the outer 
maximum is taken over all n-tuples of unit vectors in E% (which is a closed 
bounded set in EL"). Since this is a continuous function the maximum is 
attained. 


Remark 2. Note that if equality holds in (3) then (uj, uj)? = rc) or the 
lines form a set of equiangular lines. It is well known that the existence of n 
equiangular lines in E% with angle 0 is equivalent to the existence of a graph 
with n vertices so that the smallest eigenvalue of its Seidel matrix is — 555 
and has multiplicity n—d. Already in E? it seems that the determination of the 
exact value of a(n,3) may be a very difficult problem. For instance, it is not 
possible to construct a set L of 5 equiangular lines in E® with cosa(L) = we 
even though we can construct a set of 5 equiangular lines. Indeed the only 
way to do it is by taking 5 of the 6 diagonals of the icosahedron. The angle 


determined by any pair of these lines is: arccos ae Raphael Robinson (private 


communication) asked whether a(5,3) = a(6,3), or more generally if there 
are integers n for which a(n,3) = a(n+1,3)? For spherical caps there are a 
few values for which this is the case, L. Danzer [6] proved that the widest 
angle for 11 spherical caps is the same as for 12 (arccos # = 63°26/5.8”). 


He also showed that for 10 spherical caps the widest angle is > 66°8'48.3” 
hence spherical caps cannot be used to prove that a(5,3) = a(6,3), for more 
information see [7] and [8]. Since the maximum number of equiangular lines 
in E® is 6, the bound also will not be attained for n > 6. It will also be 
interesting to know whether there are values of n > 5 for which the extremal 
configuration is not unique. 

For n = d+1 we get cos(a(d + 1,d)) = 4 and this is always attained 
by the d+ 1 lines connecting the center of the regular simplex to its vertices. 
There are many sporadic cases where the bound is attained. For instance 
a(16,6) = 3 and there are 16 equiangular lines in R° with angle arccos($); 


3 


a(176, 22) = a(276, 23) = z and in both these cases there are appropriate 


sets of equiangular lines (see [5/). A particularly attractive case is the case 


of 2d lines in R%. In this case cosa(2d,d) = / Woe — as This 
bound is attained for d = 3,5 but not for d = 4. Indeed, there is no Seidel 
Matrix of order 8 with smallest eigenvalue —/7. On the other hand for 
d = 5 it is possible to construct 10 lines in R° so that the angle between 
any distinct pair is arccos $: to see this consider the Seidel matrix of the 
Petersen graph. It’s eigenvalues are (3)**}, (—3)'®} hence this matrix can be 
used to construct 10 equiangular lines in R° with angle arccos :. Recall that 
a conference matrix of order n is a symmetric n x n matrix A with 0’s along 
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the diagonal, +1 elsewhere satisfying A? = (n —1)I (/1]). Its eigenvalues 
are +/n—1. Hence if a conference matriz of order 2d exists it will yield 
2d equiangular lines in R% with angle a(2d,d). A necessary condition for the 
existence of a conference matrix of order n is thatn = 2 mod 4 andn—1 
is a sum of two squares (J. H. van Lint and J. J. Seidel). There are exactly 
4 distinct conference matrices of order 26 (Weisfeiler [10]), there are at least 
18 conference matrices of order 50. In each one of these cases a(2d,d) will 
be attained by distinct configurations. Conference matrices of orders 6, 14, 
30, 38, 42, 46 are known to exist hence in all corresponding dimensions the 
bound for the largest angle is attained. 


Acknowledgements The author is indebted to Branko Grtinbaum and Victor 
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Summary. Two finite sets of points in the plane are called mutually avoiding if 
any straight line passing through two points of any one of these two sets does not 
intersect the convex hull of the other set. For any integer n, we construct a set of 
n points in general position in the plane which contains no pair of mutually avoiding 
sets of size more than O(,/n) each. The given bound is tight up to a constant factor, 
since Aronov et al. [1] showed a polynomial-time algorithm for finding two mutually 
avoiding sets of size 2(,/n) each in any set of n points in general position in the 
plane. 


1. Introduction 


Let A and B be two disjoint finite sets of points in the plane such that 
their union contains no three points on a line. We say that A avoids B if no 
straight line determined by a pair of points of A intersects the convex hull 
of B. A and B are called mutually avoiding if A avoids B and B avoids A. 
In this note we investigate the maximum size of a pair of mutually avoiding 
sets in a given set of n points in the plane. 

Aronov et al. [1] showed that any set of n points in general position in 
the plane (ie., no three points lie on a line) contains a pair of mutually 
avoiding sets, both of size at least \/n/12. Moreover, they gave an algorithm 
for finding such a pair of sets in time O(n log). In Sect. 2 we construct, for 
any integer n, a set of n points in general position in the plane which contains 
no pair of mutually avoiding sets of size more than 11,/n each. 

Mutually avoiding sets in a d-dimensional space are defined similarly. Any 
set of n points in general position in R? contains a pair of mutually avoiding 
sets, both of size at least O(nF=a1) (see [1]). On the other hand, our method 
described for the planar case in Sect. 2 yields a construction of sets of n points 
in R¢ with no pair of mutually avoiding sets of size more than O(n! 1/4), 

Now we recall some definitions from [1]. A set of line segments, each 
joining a pair of the given points, is called a crossing family if any two line 
segments intersect in the interior. Two line segments are called parallel if 
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they are two opposite sides of a convex quadrilateral. In other words, two 
line segments are parallel if their endpoints form two mutually avoiding sets 
of size 2. It is an easy observation that any pair of avoiding sets of size s 
can be rebuilt into s pairwise parallel line segments or into a crossing family 
of size s. Aronov et al. [1] used this observation and the above result on 
mutually avoiding sets for finding a crossing family of size 2(,/n) and a set 
of 2(./n) pairwise parallel line segments. 

The result on pairwise parallel line segments was strengthened and 
extended to a higher dimension by Pach. Pach [8] showed that any set of 
n points in general position in R¢ contains at least Q(n!/¢) d-dimensional 
simplices (i.e., (d + 1)-point subsets) which are pairwise mutually avoiding. 

In Sect. 3 we show a relation between mutually avoiding sets and Erdés’ 
well-known empty-hexagon-problem. 


2. Sets with Small Mutually Avoiding Subsets 


For a finite set P of points in the plane, let g(P) denote the ratio of the 
maximum distance of any pair of points of P to the minimum distance of 
any pair of points of P. For example, if P is a square grid ./n x \/n then 
q(P) = V2(/n — 1). In this section we show: 


Theorem 1. Letc> 0 be a positive constant. Then any set P of n points in 
the plane satisfying q(P) < c/n contains no pair of mutually avoiding sets 
of size more than [2(/17 + L)eV/n] each. 


One of the basic results about covering says that for any integer n > 2 
there is a set P of n points in the plane with q(P) < con, where co = 


\/2V3/m = 1.05 (see [5]). Such a set P can be found as the triangle grid 
inside a disk of appropriate size. If we slightly perturb the points of P, we 
obtain a set in general position still satisfying q(P) < co./n. According to 
Theorem 1 this set contains no pair of mutually avoiding sets of size more 
than 11\/n. (It is obvious for n < 100. For n > 100 we use the estimation 


[2(V17 + 1l)coV/n] < 11Vn.) 


Proof of Theorem 1. Let P be a set of n points in the plane satisfying g(P) < 
c\/n. Without loss of generality, we may and shall assume that the minimum 
distance in P is 1. Let A and B be two mutually avoiding subsets of the set P. 
Define Cartesian coordinates so that for some positive constant d € (0, $c/n) 
all points of AU B lie in the closed strip between the two vertical lines 
p:x=-—dand q: x =d, and one of the sets A and B, say A, has a point on 
the line p and a point on the line g. Moreover, let the topmost point bo of the 
set B lie on the z-axis and let the set A lie “above” the set B (i.e., the set 
A lies above any straight line connecting two points of B). Since bo lies on the 
x-axis, all points of AU B lie between the two horizontal lines r : y = —c,/n 
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Fig. 1 Auxiliary lines and points 


and s: y = c\/n. Define three points u, v, w as those points in which the line 
s intersects the line p, the y-axis, and the line gq, respectively (see Fig. 1). 

For each point 6 € B, let f(b) be that point in which the line segment 
by intersects the x-axis. Now we show that, for any two points b,b’ € B, the 
distance | f(b) f(b’)| between f(b) and f(b’) is greater than Witney 

If the line 6b’ is horizontal then | f(b) f(0’)| > $|bb’| => 5 > Witenes" 
If the line 6d’ is not horizontal then it intersects the line s in some point g 
outside the segment uw (see Fig. 2). Thus |gv| > d. Without loss of generality, 
assume that 6 is closer to the line s than b’. Let z be that point on b’v, for 
which bz is horizontal. 

Now estimate 


[oe ol, ioe 
al |gu| + |vb’| \gv| + /Qe/n)? + a2 +e 

d . d _ d 
d+V/Qe/n +e ~ a4 leat eva)? + evn)? (3 + Bevin) 


and 


[bz] = - [gu] > [b8'| - 


d 
(V17 + l)evn 


Since the points f(b), b€ B a placed on a line segment of length 2d, the 
size of B is at most [2d/ WT B wrapegs | = [2(V17 + Levn]. 


If O)F(H)| > S [bal > 
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Fig. 2 Auxiliary points and triangles 


3. Relation Between Mutually Avoiding Sets 
and the Empty-Hexagon Problem 


Let A be a finite set of points in general position in the plane. A subset S of 
A of size k is called convex if its elements are vertices of a convex k-gon. If 
S is convex and the interior of the corresponding convex k-gon contains no 
point of A, then S is called a k-hole (or an empty k-gon). The classical Erdés- 
Szekeres Theorem [4] (1935) says that if the size of A is at least (7"7;') +1 
then A contains a convex subset of size k. 

Erdés [3] asked whether the following sharpening of the Erdés-Szekeres 
theorem is true. Is there a least integer n(k) such that any set of n(&) points in 
general position in the plane contains a k-hole? He pointed out that n(4) = 5 
and Harborth [6] proved n(5) = 10. However, as Horton [7] shows, n(k) does 
not exist for k > 7. The question about the existence of n(6) (the empty- 
hexagon-problem) is still open. After a definition we formulate a conjecture 
which, if true, would imply that the number n(6) exists. 


Definition 3. Let A be a finite set of points in general position in the plane. 
Let k > 2,1 > 2. A subset S of A of size k +1 is called a (k,l)-set if S is 
a union of two disjoint sets K and L so that the following three conditions 
hold: 


() |K| =k, |Ll=1, 
(ti) K and L are mutually avoiding, 
(itt) the convex hull of S contains no points of A— S. 
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Conjecture 1 (Bardny, Valtr). For any two integers k > 2 andl > 2, 
there is an integer p(k,1) such that any set of at least p(k, 1) points in general 
position in the plane contains a (k,1)-set. 


If Conjecture 1 is true for & = | = 6 then the number n(6) exists. It 
follows from the fact that any (6,6)-set contains a 6-hole (it can be shown 
that either one of the corresponding sets K and L is a 6-hole or there is a 
6-hole containing three points of K and three points of L). 

Note that all known constructions of large sets with no 6-hole (see [7, 9]) 
satisfy Conjecture 1 already for rather small integers p(k, 1). 

We cannot even prove that the numbers p(k, 2), k > 5 exist. (Note that 
p(2,2) = 5 and p(3,2) = 7 are the minimum values of p(k, 2), k = 2,3, for 
which Conjecture 1 holds.) The existence of numbers p(k, 2), k > 2 would 
imply the following conjecture: 


Conjecture 2 (Baradny, Valtr). For any integer k > 0, there is an integer 
R(k) such that any set of at least R(k) points in general position in the 
plane contains k +2 points x,y, 21, 22,.--,2h such that the k sets {x, y, z:}, 
i=1,...,k are 3-holes (i.e., they form empty triangles). 


Barany [2] proved that Conjecture 2 holds for k < 10. 
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